Joel Bernstein created SOLR-8965:
------------------------------------
Summary: Add Path reduce operation to aggregate paths taken in a
session
Key: SOLR-8965
URL: https://issues.apache.org/jira/browse/SOLR-8965
Project: Solr
Issue Type: New Feature
Reporter: Joel Bernstein
One of the things it's tricky to do at scale is session aggregation. MapReduce
of course makes this easy. Now that we have MapReduce it would be good to add
some session aggregations to the base library.
The Path reduce operation can be used with the *reduce* function to concatenate
the path taken in a session into a single field. These path records can then be
added to another SolrCloud collection using the update stream. Once they have
been consolidated in another collection aggregations can be run on the paths
using the RollupStream.
A HashRollupStream could also be developed to aggregate the paths as they are
reduced. The HashRollupStream would keep all the paths in a hash map during the
aggregation so it would not require the paths to be received in order.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]