[ 
https://issues.apache.org/jira/browse/SOLR-10678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16008598#comment-16008598
 ] 

Dawid Weiss commented on SOLR-10678:
------------------------------------

Clustering should run when the search results are collected. In normal mode, 
this happens in {{process}} (as the tests show), but in distributed mode (as I 
assume and partially confirmed by running the distrib. tests) the search 
request is routed to collect the results, each shard calls {{process}} and then 
everything is collected in {{finishStage}}. Since we don't need to run 
clustering on each shard, only once the results are collected, I think we can 
skip clustering in {{process}} if we're sure it's part of a distributed request 
that will eventually be finalized in {{finishStage}}.

But all the above is based on my vague understanding of how it works 
internally, so I'd like to get some confirmation that it's actually correct 
before I try to change the code.

> Clustering can be executed multiple times in distributed mode
> -------------------------------------------------------------
>
>                 Key: SOLR-10678
>                 URL: https://issues.apache.org/jira/browse/SOLR-10678
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Dawid Weiss
>            Assignee: Dawid Weiss
>            Priority: Minor
>
> As reported on SO: 
> http://stackoverflow.com/questions/43877284/how-does-solr-clustering-component-work/43937064#43937064



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to