[ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-2894:
---------------------------

    Attachment: SOLR-2894_cloud_test.patch

bq. I'm not able to reproduce this. Could you tell me a little more about your 
setup?

trunk, with patch applied, build the example and then run the [Simple Two-Shard 
Cluster|https://cwiki.apache.org/confluence/display/solr/Getting+Started+with+SolrCloud#GettingStartedwithSolrCloud-SimpleTwo-ShardClusterontheSameMachine]
 ...

{noformat}
hossman@frisbee:~/lucene/dev/solr$ cp -r example node1
hossman@frisbee:~/lucene/dev/solr$ cp -r example node2

# in term1...
hossman@frisbee:~/lucene/dev/solr/node1$ java -DzkRun -DnumShards=2 
-Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf -jar 
start.jar

# wait for node1 startup, then in term2...
hossman@frisbee:~/lucene/dev/solr/node2$ java -Djetty.port=7574 
-DzkHost=localhost:9983 -jar start.jar

# wait for node2 startup, then in term3...
hossman@frisbee:~/lucene/dev/solr/example/exampledocs$ java -jar post.jar *.xml
SimplePostTool version 1.5
Posting files to base url http://localhost:8983/solr/update using content-type 
application/xml..
...
14 files indexed.
COMMITting Solr index changes to http://localhost:8983/solr/update..
Time spent: 0:00:01.763
hossman@frisbee:~/lucene/dev/solr/example/exampledocs$ curl 
'http://localhost:8983/solr/select?q=*:*&sort=id+desc&rows=2&facet=true&facet.pivot=cat,manu_+id_s,inStock&facet.limit=3'
 > /dev/null

# watch the logs in term1 and term2 go spinning like mad
{noformat}

----

bq. While the size of the shard parameters may not strictly be as efficient as 
possible, is it such that we can run with that for now and circle back to this 
at a later point, or are you uncomfortable with including the parameters as is 
in the initial commit?

Hmm... not sure how i feel about it w/o more testing - from what i was seeing, 
with non-trivial field names, term values, and facet.limit the refinements 
requests were getting *HUGE* so I suspect it's something we're going to want to 
tackle before releasing -- but refactoring it to be smaller definitely seems 
like something that should be a lower priority to some of the correctness 
related issues we're finding, and adding more tests (so we can be confident the 
refactoring is correct)

----

I'm attaching a "SOLR-2894_cloud_test.patch" that contains a new cloud based 
randomized test i've been working at off and on over the last few days (I 
created it as a standalone patch because i didn't want to conflict with 
anything Brett might be in the middle of, and it was easy to do - kept me 
focused on the test and not dabbling with the internals).  

The test builds up a bunch of random docs, then does a handfull of random pivot 
facet queries.  For each pivot query, it recursively walks the pivot response 
executing verification queries using "fq" params it builds up from the pivot 
constraints -- so if pivot.facet=a,b,c says that "a" has a term "x" with 4 
matching docs, it adds an "fq=a:x" to the original query and checks the count; 
then it looks a the pivot terms for field "b" under "a:x" and also executes a 
query for each of them with another fq added, etc...

As is, the patch currently passes, but that's only because of a few nocommits...

* randomization of mincount is disabled due to the refinement bug i mentioned 
before
* it's currently only doing pivots on 2 string fields (one multivalued and one 
single valued) ... any attempts at pivot faceting the numeric/date/boolean 
fields (already included in the docs) causes an NPE in the SolrJ QueryResponse 
class (i haven't investigated why yet)



> Implement distributed pivot faceting
> ------------------------------------
>
>                 Key: SOLR-2894
>                 URL: https://issues.apache.org/jira/browse/SOLR-2894
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erik Hatcher
>             Fix For: 4.9, 5.0
>
>         Attachments: SOLR-2894-reworked.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894_cloud_test.patch, dateToObject.patch, pivot_mincount_problem.sh
>
>
> Following up on SOLR-792, pivot faceting currently only supports 
> undistributed mode.  Distributed pivot faceting needs to be implemented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to