[ 
https://issues.apache.org/jira/browse/ACCUMULO-3592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328087#comment-14328087
 ] 

Josh Elser commented on ACCUMULO-3592:
--------------------------------------

bq. If you're suggesting that this change will not require future RFile version 
number bumps when we add stuff to this structure, then yeah, let's do it now

Right. Things that we can naturally represent inside this schema-drifting data 
structure (and, with proper use by the reader) would allow old readers to read 
new files. For example, we add some new value to the header, hypothetically: 
some density value for key-values in the file. The reader implementation can 
use the density score to perform more optimized seeks/reads. The old readers 
can still function without this new attribute in the header (and would parse it 
without error), while new readers can read it (when present) and leverage it to 
do things more efficiently.

It obviously doesn't protect us from any/all changes, but it should reduce the 
cases in which we'd otherwise need to bump the RFile version.

> Improvements to RFile
> ---------------------
>
>                 Key: ACCUMULO-3592
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3592
>             Project: Accumulo
>          Issue Type: Sub-task
>            Reporter: Josh Elser
>             Fix For: 1.7.0
>
>
> We have a small "header" at the beginning of the RFile index at the end of 
> the file. This is currently just a few writable things thrown together -- it 
> would be nice to convert this into some pb/thrift message that we can 
> add/remove things easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to