If you have doc value fields A-Z, and you know that fields A, T, and U are commonly accessed together, they could be put into the same family and therefor the same file. Wouldn't this give you some small I/O gain since they are contiguous? It could very well be that the overhead of additional files is not worth it.

On 3/14/13 3:51 PM, Robert Muir wrote:
Whats the advantage of using more files?

fyi If you wanted to do this, you can abuse PerFieldDocValuesFormat
today to do it, e.g.:

     iwc.setCodec(new Lucene42Codec() {

       final DocValuesFormat group1 = new Lucene42DocValuesFormat();
       final DocValuesFormat group2 = new Lucene42DocValuesFormat();
       ...

       @Override
       public DocValuesFormat getDocValuesFormatForField(String field) {
         if (field in some list) {
           return group1;
         } else {
           return group2;
         } ...
       }
     });

On Thu, Mar 14, 2013 at 3:43 PM, David Arthur <[email protected]> wrote:
I have an experimental patch that adds support for field families for doc
values. The idea is taken from various BigTable implementations where a set
of fields can be configured to appear in the same physical file. The idea
is, rather than putting all docvalue fields into a single file, they can be
grouped together if they are commonly accessed together.

Would this be something worth fleshing out and contributing?

-David

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to