Re: [I] Significant increase in IOPS during segment merge in Lucene 10 [lucene]

via GitHub Tue, 09 Jun 2026 23:07:42 -0700


amalatlas commented on issue #16213:
URL: https://github.com/apache/lucene/issues/16213#issuecomment-4667086176


   When digging deeper into the code, I realised `fieldsStream =
       d.openInput(fieldsStreamFN, context.withHints(FileTypeHint.DATA, 
DataAccessHint.RANDOM));` is a no-op in the Merge path and that `NORMAL` is 
actually being used when opening the file. The profiling indicates 
`Lucene90CompressingStoredFieldsWriter.merge` is the culprit, so I believe this 
could be originating from OpenSearch codebase than Lucene itself. 
   My current hypothesis is that OpenSearch is setting the `madv` to 
`SEQUENTIAL` which is probably still not correct, as the merge path would read 
to the end for checksum calculation and then moves back to read the stored 
fields in the merge path. When it moves back kernel may have already discarded 
the read pages due to aggressive `SEQUENTIAL` advise.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] Significant increase in IOPS during segment merge in Lucene 10 [lucene]

Reply via email to