[ 
https://issues.apache.org/jira/browse/LUCENE-3515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13126820#comment-13126820
 ] 

Robert Muir commented on LUCENE-3515:
-------------------------------------

it looks like the bug will especially affect any directory that uses 
bufferedindexinput (NIOFS/SimpleFS).

The problem is multitermsenum doesnt reuse the sub-docs&positionsenums, so for 
each term*segment we clone the input, and
bufferedindexinput.clone() sets the clone's buffer to null.

so across lots of low freq-terms we re-read 4096 bytes (MERGE_BUFFER_SIZE) to 
refill the buffer on each one...

mmapdirectory is less affected because it has no buffer to re-read but seems 
like fixing the reusing would even help it...
                
> Possible slowdown of indexing/merging on 3.x vs trunk
> -----------------------------------------------------
>
>                 Key: LUCENE-3515
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3515
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: core/index
>            Reporter: Michael McCandless
>             Fix For: 3.5, 4.0
>
>         Attachments: LUCENE-index-34.patch, LUCENE-index-40.patch, 
> TestGenerationTime.java.3x, TestGenerationTime.java.40, 
> stdout-snow-leopard.tar.gz
>
>
> Opening an issue to pursue the possible slowdown Marc Sturlese uncovered.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to