[ 
https://issues.apache.org/jira/browse/LUCENE-5188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13757641#comment-13757641
 ] 

Adrien Grand commented on LUCENE-5188:
--------------------------------------

These bytes can be shared because they are write-only, kind of like /dev/null. 
Having this on DataInput to be able to skip an entire decompression would be 
nice but unfortunately with the current design, the field numbers are stored in 
the compressed stream, so you need to decompress anyway to know whether you 
should skip (StoredFieldVisitor allows to skip based on the FieldInfo, that my 
StoredFieldReader computes from the field number). But your idea is something I 
would like to explore for the next StoredFieldsFormat, along with preset 
dictionaries.
                
> Make CompressingStoredFieldsFormat more friendly to StoredFieldVisitors
> -----------------------------------------------------------------------
>
>                 Key: LUCENE-5188
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5188
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-5188.patch
>
>
> The way CompressingStoredFieldsFormat works is that it first decompresses 
> data and then consults the StoredFieldVisitor. This is a bit wasteful in case 
> documents are big and only the first field of a document is of interest so 
> maybe we could decompress and consult the StoredFieldVicitor in a more 
> streaming fashion.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to