[ 
https://issues.apache.org/jira/browse/LUCENE-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13720856#comment-13720856
 ] 

Adrien Grand commented on LUCENE-5127:
--------------------------------------

This is a very nice cleanup! In FixedGapTermsIndexWriter, I think we could 
improve the buffering of offsets and addresses by directly buffering into a 
MonotonicBlockPackedWriter over a RamOutputStream, and then copy the raw 
content of the RamOutputStream to the IndexOutput? This would avoid an extra 
encoding/decoding step.
                
> FixedGapTermsIndex should use monotonic compression
> ---------------------------------------------------
>
>                 Key: LUCENE-5127
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5127
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Robert Muir
>         Attachments: LUCENE-5127.patch, LUCENE-5127.patch, LUCENE-5127.patch
>
>
> for the addresses in the big in-memory byte[] and disk blocks, we could save 
> a good deal of RAM here.
> I think this codec just never got upgraded when we added these new packed 
> improvements, but it might be interesting to try to use for the terms data of 
> sorted/sortedset DV implementations.
> patch works, but has nocommits and currently ignores the divisor. The 
> annoying problem there being that we have the shared interface with 
> "get(int)" for PackedInts.Mutable/Reader, but no equivalent base class for 
> monotonics get(long)... 
> Still its enough that we could benchmark/compare for now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to