If you have doc value fields A-Z, and you know that fields A, T, and U
are commonly accessed together, they could be put into the same family
and therefor the same file. Wouldn't this give you some small I/O gain
since they are contiguous? It could very well be that the overhead of
additional files is not worth it.
On 3/14/13 3:51 PM, Robert Muir wrote:
Whats the advantage of using more files?
fyi If you wanted to do this, you can abuse PerFieldDocValuesFormat
today to do it, e.g.:
iwc.setCodec(new Lucene42Codec() {
final DocValuesFormat group1 = new Lucene42DocValuesFormat();
final DocValuesFormat group2 = new Lucene42DocValuesFormat();
...
@Override
public DocValuesFormat getDocValuesFormatForField(String field) {
if (field in some list) {
return group1;
} else {
return group2;
} ...
}
});
On Thu, Mar 14, 2013 at 3:43 PM, David Arthur <[email protected]> wrote:
I have an experimental patch that adds support for field families for doc
values. The idea is taken from various BigTable implementations where a set
of fields can be configured to appear in the same physical file. The idea
is, rather than putting all docvalue fields into a single file, they can be
grouped together if they are commonly accessed together.
Would this be something worth fleshing out and contributing?
-David
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]