[
https://issues.apache.org/jira/browse/LUCENE-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951203#comment-16951203
]
ASF subversion and git services commented on LUCENE-9003:
---------------------------------------------------------
Commit e5ca494769ce31a25fb27b2f4e1153632843dd3f in lucene-solr's branch
refs/heads/branch_8x from Adrien Grand
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=e5ca494 ]
LUCENE-9003: Compute numDocs() lazily. (#939)
> Should FilterDirectoryReader compute numDocs lazily?
> ----------------------------------------------------
>
> Key: LUCENE-9003
> URL: https://issues.apache.org/jira/browse/LUCENE-9003
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Minor
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> FilterDirectoryReader extends BaseCompositeReader, which computes both maxDoc
> and numDocs eagerly in its constructor by summing up these values across all
> sub leaves.
> This is problematic for readers that hide additional documents. Computing
> numDocs on such leaf readers usually requires iterating over all live
> documents to count them. This makes creating a FilterDirectoryReader on top
> run in linear time, which has caused several performance bugs to us over
> time. This is especially frustrating given that numDocs is a rarely used
> index statistic.
> I think computing numDocs lazily would be less surprising?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]