[jira] [Commented] (SOLR-4763) Performance issue when using group.facet=true
[ https://issues.apache.org/jira/browse/SOLR-4763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14681683#comment-14681683 ] Ovidiu Mihalcea commented on SOLR-4763: --- Would really need some good news with this. We really need result grouping with faceting and this is really slowing our site... :( Performance issue when using group.facet=true - Key: SOLR-4763 URL: https://issues.apache.org/jira/browse/SOLR-4763 Project: Solr Issue Type: Bug Affects Versions: 4.2 Reporter: Alexander Koval Assignee: Erick Erickson Fix For: 5.3, Trunk Attachments: SOLR-4763.patch, SOLR-4763.patch, SOLR-4763.patch I do not know whether this is bug or not. But calculating facets with {{group.facet=true}} is too slow. I have query that: {code} matches: 730597, ngroups: 24024, {code} 1. All queries with {{group.facet=true}}: {code} QTime: 5171 facet: { time: 4716 {code} 2. Without {{group.facet}}: * First query: {code} QTime: 3284 facet: { time: 3104 {code} * Next queries: {code} QTime: 230, facet: { time: 76 {code} So I think with {{group.facet=true}} Solr doesn't use cache to calculate facets. Is it possible to improve performance of facets when {{group.facet=true}}? -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-7036) Faster method for group.facet
[ https://issues.apache.org/jira/browse/SOLR-7036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14680217#comment-14680217 ] Ovidiu Mihalcea commented on SOLR-7036: --- Hello guys, Would really need some good news with this. We really need result grouping with faceting and this is really slowing our site... Thanks a lot! Faster method for group.facet - Key: SOLR-7036 URL: https://issues.apache.org/jira/browse/SOLR-7036 Project: Solr Issue Type: Improvement Components: faceting Affects Versions: 4.10.3 Reporter: Jim Musil Assignee: Erick Erickson Fix For: 5.3, Trunk Attachments: SOLR-7036.patch, SOLR-7036.patch This is a patch that speeds up the performance of requests made with group.facet=true. The original code that collects and counts unique facet values for each group does not use the same improved field cache methods that have been added for normal faceting in recent versions. Specifically, this approach leverages the UninvertedField class which provides a much faster way to look up docs that contain a term. I've also added a simple grouping map so that when a term is found for a doc, it can quickly look up the group to which it belongs. Group faceting was very slow for our data set and when the number of docs or terms was high, the latency spiked to multiple second requests. This solution provides better overall performance -- from an average of 54ms to 32ms. It also dropped our slowest performing queries way down -- from 6012ms to 991ms. I also added a few tests. I added an additional parameter so that you can choose to use this method or the original. Add group.facet.method=fc to use the improved method or group.facet.method=original which is the default if not specified. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-7495) Unexpected docvalues type NUMERIC when grouping by a int facet
[ https://issues.apache.org/jira/browse/SOLR-7495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14661934#comment-14661934 ] Ovidiu Mihalcea commented on SOLR-7495: --- Any news when this would be solved? For now, Solr 5 seems much much more buggy then 4.x... Unexpected docvalues type NUMERIC when grouping by a int facet -- Key: SOLR-7495 URL: https://issues.apache.org/jira/browse/SOLR-7495 Project: Solr Issue Type: Bug Affects Versions: 5.1 Reporter: Fabio Batista da Silva Hey All, After upgrading from solr 4.10 to 5.1 with solr could I'm getting a IllegalStateException when i try to facet a int field. IllegalStateException: unexpected docvalues type NUMERIC for field 'year' (expected=SORTED). Use UninvertingReader or index with docvalues. schema.xml {code} ?xml version=1.0 ? schema name=schema version=1.2 fields !-- solar cloud version field -- field name=_version_ type=long indexed=true stored=true/ !-- Common fields -- field name=id type=string indexed=true stored=true multiValued=false required=true/ field name=index_type type=string indexed=true stored=true multiValued=false required=true/ field name=year type=int indexed=true stored=true/ field name=model type=string indexed=true stored=true/ field name=year_make_model type=string indexed=true stored=true/ /fields !-- Field Types -- types fieldType name=string class=solr.StrField sortMissingLast=true / fieldType name=boolean class=solr.BoolField sortMissingLast=true/ fieldType name=int class=solr.TrieIntField precisionStep=0 positionIncrementGap=0/ fieldType name=float class=solr.TrieFloatField precisionStep=0 positionIncrementGap=0/ fieldType name=long class=solr.TrieLongField precisionStep=0 positionIncrementGap=0/ fieldType name=double class=solr.TrieDoubleField precisionStep=0 positionIncrementGap=0/ fieldType name=date class=solr.TrieDateField precisionStep=0 positionIncrementGap=0/ fieldType name=text_ngram class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.LowerCaseFilterFactory/ filter class=solr.EdgeNGramFilterFactory minGramSize=2 maxGramSize=15/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType fieldType name=text_general class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.LowerCaseFilterFactory/ filter class=solr.EdgeNGramFilterFactory minGramSize=2 maxGramSize=15/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType fieldType name=location_rpt class=solr.SpatialRecursivePrefixTreeFieldType geo=true distErrPct=0.025 maxDistErr=0.09 units=degrees / /types uniqueKeyid/uniqueKey defaultSearchFieldname/defaultSearchField solrQueryParser defaultOperator=OR/ /schema {code} query : {code} http://solr.dev:8983/solr/my_collection/select?wt=jsonfl=idfq=index_type:foobargroup=truegroup.field=year_make_modelgroup.facet=truefacet=truefacet.field=year {code} Exception : {code} ull:org.apache.solr.common.SolrException: Exception during facet.field: year at org.apache.solr.request.SimpleFacets$3.call(SimpleFacets.java:627) at org.apache.solr.request.SimpleFacets$3.call(SimpleFacets.java:612) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at org.apache.solr.request.SimpleFacets$2.execute(SimpleFacets.java:566) at org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:637) at
[jira] [Commented] (SOLR-7495) Unexpected docvalues type NUMERIC when grouping by a int facet
[ https://issues.apache.org/jira/browse/SOLR-7495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14538019#comment-14538019 ] Ovidiu Mihalcea commented on SOLR-7495: --- I'm having the same problem, Solr 5.1, without using SolrCloud. The problem seemed to have been fixed if I made the field I was trying to facet on multivalued. But then the logs were all full of Error creating FieldCache for field: brand_id We wanted to upgrade to Solr 5.1, but we cannot get past this bug. Unexpected docvalues type NUMERIC when grouping by a int facet -- Key: SOLR-7495 URL: https://issues.apache.org/jira/browse/SOLR-7495 Project: Solr Issue Type: Bug Affects Versions: 5.1 Reporter: Fabio Batista da Silva Hey All, After upgrading from solr 4.10 to 5.1 with solr could I'm getting a IllegalStateException when i try to facet a int field. IllegalStateException: unexpected docvalues type NUMERIC for field 'year' (expected=SORTED). Use UninvertingReader or index with docvalues. schema.xml {code} ?xml version=1.0 ? schema name=schema version=1.2 fields !-- solar cloud version field -- field name=_version_ type=long indexed=true stored=true/ !-- Common fields -- field name=id type=string indexed=true stored=true multiValued=false required=true/ field name=index_type type=string indexed=true stored=true multiValued=false required=true/ field name=year type=int indexed=true stored=true/ field name=model type=string indexed=true stored=true/ field name=year_make_model type=string indexed=true stored=true/ /fields !-- Field Types -- types fieldType name=string class=solr.StrField sortMissingLast=true / fieldType name=boolean class=solr.BoolField sortMissingLast=true/ fieldType name=int class=solr.TrieIntField precisionStep=0 positionIncrementGap=0/ fieldType name=float class=solr.TrieFloatField precisionStep=0 positionIncrementGap=0/ fieldType name=long class=solr.TrieLongField precisionStep=0 positionIncrementGap=0/ fieldType name=double class=solr.TrieDoubleField precisionStep=0 positionIncrementGap=0/ fieldType name=date class=solr.TrieDateField precisionStep=0 positionIncrementGap=0/ fieldType name=text_ngram class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.LowerCaseFilterFactory/ filter class=solr.EdgeNGramFilterFactory minGramSize=2 maxGramSize=15/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType fieldType name=text_general class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.LowerCaseFilterFactory/ filter class=solr.EdgeNGramFilterFactory minGramSize=2 maxGramSize=15/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType fieldType name=location_rpt class=solr.SpatialRecursivePrefixTreeFieldType geo=true distErrPct=0.025 maxDistErr=0.09 units=degrees / /types uniqueKeyid/uniqueKey defaultSearchFieldname/defaultSearchField solrQueryParser defaultOperator=OR/ /schema {code} query : {code} http://solr.dev:8983/solr/my_collection/select?wt=jsonfl=idfq=index_type:foobargroup=truegroup.field=year_make_modelgroup.facet=truefacet=truefacet.field=year {code} Exception : {code} ull:org.apache.solr.common.SolrException: Exception during facet.field: year at org.apache.solr.request.SimpleFacets$3.call(SimpleFacets.java:627) at org.apache.solr.request.SimpleFacets$3.call(SimpleFacets.java:612) at java.util.concurrent.FutureTask.run(FutureTask.java:262)