Re: documents with large numbers of fields

Mike Klaas Fri, 18 May 2007 14:40:08 -0700

On 18-May-07, at 1:01 PM, charlie w wrote:

So now I have the idea to invert the field name and value thusly:
foo=tag     ^2
bar=tag     ^1.2
foobar=tag    ^1.8
and search "foo:tag".
Intuitively, I would expect Lucene to be optimized for searchingthe valuesof fields, and not really the names of fields. In a somewhat largeindex,say 10 million documents, will Lucene search performance continueto be
acceptable if I load up documents with many fields like this?

Perhaps not. Storing a field with norms occupies O(N) space,regardless of the number of document with non-zero norms. Theremight be too much data for the os to cache and lucene to processefficiently.

Is there an upper limit on the number of fields comprising adocument, and
if so what is it?


There is not.  They are relatively costless if omitNorms=False

Or, is there some way to make my original approach work after all?

The experimental Payloads allows an optional boost to be stored alongwith term position. This is the intended use case.


-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: documents with large numbers of fields

Reply via email to