[ 
https://issues.apache.org/jira/browse/SOLR-8965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-8965:
---------------------------------
    Summary: Add Path reduce operation to aggregate paths in a session  (was: 
Add Path reduce operation to aggregate paths taken in a session)

> Add Path reduce operation to aggregate paths in a session
> ---------------------------------------------------------
>
>                 Key: SOLR-8965
>                 URL: https://issues.apache.org/jira/browse/SOLR-8965
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Joel Bernstein
>
> One of the things it's tricky to do at scale is session aggregation. 
> MapReduce of course makes this easy. Now that we have MapReduce it would be 
> good to add some session aggregations to the base library. 
> The Path reduce operation can be used with the *reduce* function to 
> concatenate the path taken in a session into a single field. These path 
> records can then be added to another SolrCloud collection using the update 
> stream. Once they have been consolidated in another collection aggregations 
> can be run on the paths using the RollupStream.
> A HashRollupStream could also be developed to aggregate the paths as they are 
> reduced. The HashRollupStream would keep all the paths in a hash map during 
> the aggregation so it would not require the paths to be received in order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to