Adrien Grand created LUCENE-9003:
------------------------------------

             Summary: Should FilterDirectoryReader compute numDocs lazily?
                 Key: LUCENE-9003
                 URL: https://issues.apache.org/jira/browse/LUCENE-9003
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Adrien Grand


FilterDirectoryReader extends BaseCompositeReader, which computes both maxDoc 
and numDocs eagerly in its constructor by summing up these values across all 
sub leaves.

This is problematic for readers that hide additional documents. Computing 
numDocs on such leaf readers usually requires iterating over all live documents 
to count them. This makes creating a FilterDirectoryReader on top run in linear 
time, which has caused several performance bugs to us over time. This is 
especially frustrating given that numDocs is a rarely used index statistic.

I think computing numDocs lazily would be less surprising?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to