[ 
https://issues.apache.org/jira/browse/HADOOP-5060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12676065#action_12676065
 ] 

Ari Rabkin commented on HADOOP-5060:
------------------------------------

There's substantial interest at Berkeley in doing some sort of in-collector 
aggregation, to get very short latencies.  It would be a big help if the code 
written for this JIRA were modular in such a way that it could be pulled out of 
the map-reduce framework, and run separately.   It's already pretty 
straightforward to extract Records in the collector.

> Create a generic aggregator for Chukwa
> --------------------------------------
>
>                 Key: HADOOP-5060
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5060
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: contrib/chukwa
>            Reporter: Jerome Boulon
>            Assignee: Jerome Boulon
>
> Create a generic way to compute aggregation on top of chukwaRecords based on 
> a config file
> Should be able:
> - work on several Chukwa streams 
> - To aggregate by time period
> - Group by values for specific keys
> - Provide a redefine list of functions (AVG,MIN,MAX,Counter->Rate 
> conversion...)
> - work with new functions

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to