Re: Must QueryComponent always be on and other Design Questions

Grant Ingersoll Mon, 20 Oct 2008 17:01:38 -0700

For completeness, here's the NPE:
SEVERE: java.lang.NullPointerException
        at org.apache.solr.common.util.StrUtils.splitSmart(StrUtils.java:37)

atorg.apache.solr.search.OldLuceneQParser.parse(LuceneQParserPlugin.java:104)

        at org.apache.solr.search.QParser.getQuery(QParser.java:88)

atorg.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:82)atorg.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:149)atorg.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)atorg.apache.solr.handler.clustering.ClusteringComponentTest.testComponent(ClusteringComponentTest.java:70)

Don't worry about the ClusteringComponentTest yet, I haven't postedthat code yet.


On Oct 20, 2008, at 7:56 PM, Grant Ingersoll wrote:

I've run into this a couple of times now and I feel like it warrantsa discussion
For both the SpellCheckComponent (SCC) and now for the newClusteringComponent (SOLR-769) I think there are cases where theQueryComponent (QC) is not required. In the SpellCheckComponentcase it is when building the spelling index. In theClusteringComponent, it is possible to ask for document clusterswithout running any query (it also will be possible to get clusters_with_ a query as well, and it also is distinguished from thehandling of search results clustering, too). Thus, it seems reallyweird to have to pass in a dummy query, yet that is what one has todo in order to avoid getting an NPE in the QC.
Now, I suppose these pieces could be modeled as something else orit's possible to split the two functionalities into separate things(1 ReqHandler, 1 SearchComp). In fact, the said functionality isnot really "search" functionality, or SearchComponent functionality,yet much of the rest of the functionality in the code in question is"search" functionality and logically belongs as a SearchComponent.In the case of the SCC build, it's akin to an indexing operation.In the clustering case, it's a query, albeit a non-traditional one.In some sense, this kind of document clustering is like non-querybased faceting which leads to more navigation/browsing instead ofsearching.
The quick fix is to just put in null checks into the QC or pass in adummy query with rows=0, but I'm not sure if there isn't a slightlybigger picture here that needs adjusting in terms ofSearchComponents. Namely, must the QC always be on? And, should wethink a little more about components that don't require a query inorder to function and how they play in the scheme of things?
Thoughts?  Recommendations?

-Grant

Re: Must QueryComponent always be on and other Design Questions

Reply via email to