[
https://issues.apache.org/jira/browse/SOLR-10678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16008598#comment-16008598
]
Dawid Weiss commented on SOLR-10678:
------------------------------------
Clustering should run when the search results are collected. In normal mode,
this happens in {{process}} (as the tests show), but in distributed mode (as I
assume and partially confirmed by running the distrib. tests) the search
request is routed to collect the results, each shard calls {{process}} and then
everything is collected in {{finishStage}}. Since we don't need to run
clustering on each shard, only once the results are collected, I think we can
skip clustering in {{process}} if we're sure it's part of a distributed request
that will eventually be finalized in {{finishStage}}.
But all the above is based on my vague understanding of how it works
internally, so I'd like to get some confirmation that it's actually correct
before I try to change the code.
> Clustering can be executed multiple times in distributed mode
> -------------------------------------------------------------
>
> Key: SOLR-10678
> URL: https://issues.apache.org/jira/browse/SOLR-10678
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Reporter: Dawid Weiss
> Assignee: Dawid Weiss
> Priority: Minor
>
> As reported on SO:
> http://stackoverflow.com/questions/43877284/how-does-solr-clustering-component-work/43937064#43937064
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]