Faceting optimization

Sébastien Lamy Mon, 07 Sep 2009 03:38:26 -0700

Hi

I'm currently trying to optimize the response time of my solr server.
I found one aberration and hope you may be able to help me solve it:

If, considering the whole document index, there is a lot of possiblevalues for a field, asking for facet on that field dramatically increaseresponse time. Even if the search returns only one document, with onlyone facet value for that field. This is shown by the three requests atthe bottom of this mail.

It seems to me that solr looks at all the possible values in the wholeindex for the faceted field. Whereas it should look at the possiblevalues only for the documents in the results, wich would be a lotfaster. Is there a way asking him to do so?



---
Let's look at this three requests:

1- This request returns only one document and take 3ms
http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO+T99999]

2- This request returns one document, and its facets for one field. Ittakes about 1000ms. The facet on a_10_alpha_sort returns only one value:"air du temps". But overall the whole index, there is a lot of values(>10 000) for a_10_alpha_sort.

http://localhost:8983/solr/select/?
facet=true&
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO+T99999]&
facet.field=a_10_alpha_sort&
f.a_10_alpha_sort.facet.mincount=1&
f.a_10_alpha_sort.facet.sort=true&
f.a_10_alpha_sort.facet.limit=8&

3- This request includes the value "air du temps" in the search string.It takes 3ms

http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22+AND+a_10_alpha_sort:"air+du+temps")+AND+type_s:[T0+TO+T99999]

Here is the description of the faceted field in my schema: this is asingle-valued field, with no tokens.

<dynamicField name="*_alpha_sort" type="alphaOnlySort" indexed="true"stored="false" multivalued="false"/><fieldType name="alphaOnlySort" class="solr.TextField"sortMissingLast="true" omitNorms="true">

 <analyzer>
   <!-- KeywordTokenizer does no actual tokenizing, so the entire
        input string is preserved as a single token -->
   <tokenizer class="solr.KeywordTokenizerFactory"/>
   <filter class="solr.ISOLatin1AccentFilterFactory" />
   <filter class="solr.LowerCaseFilterFactory" />
   <filter class="solr.TrimFilterFactory" />
 </analyzer>
</fieldType>

Faceting optimization

Reply via email to