Hi,

Tests show that TermEnum.docFreq() returns sum of all docs, including
the deleted ones. Which seems to (indirectly) contradict the javadoc

This frequency count is used to compute uninverted index
(DocTermOrds.uninvert()). The code goes like:

      final int df = te.docFreq();
      if (df <= maxTermDocFreq) {


So, if I happen to have many deleted documents, and maxTermDocFreq is
low, then the term will be excluded (even if the freq of the livedocs
is OK). Most likely, the cache will be incomplete.

Can it be considered a feature? Or is it a bug?

Thanks,

  roman

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to