[
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13894547#comment-13894547
]
Trey Grainger commented on SOLR-2894:
-------------------------------------
FYI, the last distributed pivot facet patch functionally works, but there are
some sub-optimal data structures being used and some unnecessary duplicate
processing of values. As a result, we found that for certain worst-case
scenarios (i.e. data is not randomly distributed across Solr cores and requires
significant refinement) pivot facets with multiple levels could take over a
minute to aggregate and process results. This was using a dataset of several
hundred million documents and dozens of pivot facets across 120 Solr cores
distributed over 20 servers, so it is a more extreme use-case than most will
encounter.
Nevertheless, we've refactored the code and data structures and brought the
processing time from over a minute down to less than a second using the above
configuration. We plan to post the patch within the next week.
> Implement distributed pivot faceting
> ------------------------------------
>
> Key: SOLR-2894
> URL: https://issues.apache.org/jira/browse/SOLR-2894
> Project: Solr
> Issue Type: Improvement
> Reporter: Erik Hatcher
> Fix For: 4.7
>
> Attachments: SOLR-2894-reworked.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch
>
>
> Following up on SOLR-792, pivot faceting currently only supports
> undistributed mode. Distributed pivot faceting needs to be implemented.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]