[
https://issues.apache.org/jira/browse/LUCENE-8602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16720232#comment-16720232
]
ASF subversion and git services commented on LUCENE-8602:
---------------------------------------------------------
Commit 8f4e4a44f8d0eb78cfb2498a48c7ac6231c6ff3b in lucene-solr's branch
refs/heads/branch_7x from [~simonw]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=8f4e4a4 ]
LUCENE-8608: Extract utility class to iterate over terms docs
Today we re-implement the same algorithm in various places
when we want to consume all docs for a set/list of terms. This
caused serious slowdowns for instance in the case of applying
updates fixed in LUCENE-8602. This change extracts the common
usage and shares the interation code including logic to reuse
Terms and PostingsEnum instances as much as possble and adds
tests for it.
> Share TermsEnum if possible while applying DV updates
> ------------------------------------------------------
>
> Key: LUCENE-8602
> URL: https://issues.apache.org/jira/browse/LUCENE-8602
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Simon Willnauer
> Priority: Major
> Fix For: master (8.0), 7.7
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> Today we pull a new terms enum when we apply DV updates even though the
> field stays the same which is the common case. Benchmarking this on a
> larger term dictionary with a significant number of updates shows a
> 2x improvement in performance.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]