Re: Slow response times using :

Mike Klaas Thu, 31 Jan 2008 16:19:19 -0800

On 31-Jan-08, at 9:41 AM, Andy Blower wrote:

Yonik Seeley wrote:
This surprises me because the filter query submitted has usuallyalreadybeen submitted along with a normal query, and so should be cachedin thefilter cache. Surely all solr needs to do is return a handful offields
for
the first 100 records in the list from the cache - or so I thought.
To calculate the DocSet (the set of all documents matching *:* and
your filters), Solr can just use it's caches as long as *:* and the
filters have been used before.
*But*, to retrieve the top 10 documents matching *:* and yourfilters,
the query must be re-run.  That is probably where the time is being
spent. Since you aren't looking for relevancy scores at all, butjust
faceting, it seems like we could potentially optimize this in Solr.
I'm actually retrieving the first 100 in my tests, which will benecessaryin one of the two scenarios we use blank queries for. The otherscenariodoesn't require any docs at all - just the facets, and I've not putthat inmy tests. What would the situation be if I specified a sort orderfor the
facets and/or retrieved no docs at all? I'd be sorting the facets
alphabetically, which is currently done by my app rather than thesearchengine. (since I sometimes have to merge facets from more than onefield)

First question: What is the use of retrieving 100 documents if thereis no defined sort order?

The situation could be optimized in Solr, but there is a related casethat _is_ optimized that should be almost as fast. If you


a) don't ask for document score in field list (fl)
b) enable <useFilterForSortedQuery> in solrconfig.xml
c) specify _some_ sort order other than score

Then Solr will do cached bitset intersections only. It will also dosorting, but that may not be terribly expensive. If it is close tothe desired performance, it would be relatively easy to patch solr tonot do that step.


(Note: this is query sort, no facet sort).

I had assumed that no doc would be considered more relevant thanany otherwithout any query terms - i.e. filter query terms wouldn't affectrelevance.This seems sensible to me, but maybe that's only because ourcurrent search
engine works that way.

It won't, but it will still try to calculate the score if you ask itto (all docs will score the same, though).

Regarding optimization, I certainly think that being able to accessallfacets for subsets of the indexed data (defined by the filterquery) is anincredibly useful feature. My search engine usage may not be verycommonthough. What it means to us is that we can drive all aspects of oursites
from the search engine, not just the obvious search forms.

I also use this feature. It would be useful to optimize the casewhere rows=0.


-Mike

Re: Slow response times using *:*

Reply via email to

Re: Slow response times using :