[
https://issues.apache.org/jira/browse/SOLR-8965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joel Bernstein updated SOLR-8965:
---------------------------------
Summary: Add Path reduce operation to aggregate paths in a session (was:
Add Path reduce operation to aggregate paths taken in a session)
> Add Path reduce operation to aggregate paths in a session
> ---------------------------------------------------------
>
> Key: SOLR-8965
> URL: https://issues.apache.org/jira/browse/SOLR-8965
> Project: Solr
> Issue Type: New Feature
> Reporter: Joel Bernstein
>
> One of the things it's tricky to do at scale is session aggregation.
> MapReduce of course makes this easy. Now that we have MapReduce it would be
> good to add some session aggregations to the base library.
> The Path reduce operation can be used with the *reduce* function to
> concatenate the path taken in a session into a single field. These path
> records can then be added to another SolrCloud collection using the update
> stream. Once they have been consolidated in another collection aggregations
> can be run on the paths using the RollupStream.
> A HashRollupStream could also be developed to aggregate the paths as they are
> reduced. The HashRollupStream would keep all the paths in a hash map during
> the aggregation so it would not require the paths to be received in order.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]