atris commented on issue #754: LUCENE-8875: Introduce Optimized Collector For 
Large Number Of Hits
URL: https://github.com/apache/lucene-solr/pull/754#issuecomment-508914746
 
 
   @jpountz @tokee Thanks for your comments.
   
   I am planning to maintain an ArrayList of ScoreDocs and collect numHits, 
then do a sort on score and return top N as Adrien suggested. This should 
optimize the performance bottleneck around PQ allocating slots and 
prepopulating sentinel values.
   
   I am inclined to pursue the idea of not using ScoreDoc and representing 
score + docID as a separate issue primarily due to the nuances involved with 
handling shard indices. I do feel that should not be a major blocker anymore 
given the fact that TopDocs.merge can now tie break on docIDs and ignore shard 
indices completely during tie breaking.
   
   Thoughts?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to