Hi~
We are trying to build an OLAP database based on lucene, and we heavily use
lucene's DocValues (as our column store).
We try to use DocValues to store the array type field. For example, if we want
to store the field1 and feild2 in this json document into DocValues
respectively,
Depending on what you use the field for, you can use BinaryDocValuesField
which encodes a byte[] and lets you store the data however you want. But
how are you using these fields later at search time?
On Tue, Jun 28, 2022 at 3:46 PM linfeng lu wrote:
> Hi~
>
> We are trying to build an OLAP
I think it depends on what information we actually want to get here. If it’s
just finding which fields matched in which document, then running Matches over
the top-k results is fine. If you want to get some kind of aggregate data, as
in you want to get a list of fields that matched in *any*
I finished the second prototype. With a few exceptions, almost all existing
issues were successfully migrated into the test repo. You can browse/search
them.
https://github.com/mocobeta/sandbox-lucene-10557/issues
Some limitations in the first prototype have been addressed. For example,
we can