Re: [jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

robert engels Sun, 07 Dec 2008 07:56:23 -0800

One thing to keep in mind about using the field cache for filtercaching.

The filter bitset cache at worst holds 8 documents per byte (and withbitset compression this can be even more efficient).

Using the field cache is going to rather be bytes per document, mostlikely at least an order of magnitude greater, and maybe 2 (if usingstrings fields, due to object overhead, length of string, plus themanagement fields...).

This is why I think for many users the field cache is not the bestsolution. If you have lots of documents but searchers that returnrelatively few, then using filters and sorting the results usingstored fields is far more efficient.

It seems to me that the field cache is only appropriate when thedocuments have very few fields in play (1-3 ?), otherwise cachedrange filters will work better. If we also have partitioned (triequery) and compressed filters, then the cache is only useful forsorting.

The most important use for the field cache seems to be the case wherea query returns lots of documents, say by date range, AND you wantthe most recent ones to score higher (needing the sort) - basicallyusing the cache for the selection and the sort.


On Dec 7, 2008, at 3:42 AM, Michael McCandless wrote:

Mark Miller wrote:
MultiSearcher has a few aspects I don't like.
Do you mean the score differences vs IndexSearcher(MultiReader),or is there something else?
And rewrite does not work properly. And to get 30 docs over 3indexes, you ask for 90. And sort twice.
I'm thinking we stick with MultiReader, but improve it so that whensorting by fields can use a [new FieldCache like] API such thatgives it the benefits that MultiSearcher has. Ie, best of bothworlds.
Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

Reply via email to