[ 
https://issues.apache.org/jira/browse/CHUKWA-311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721998#action_12721998
 ] 

Ari Rabkin commented on CHUKWA-311:
-----------------------------------

Are we talking only about rolling the post-demux records, or also the raw 
chunks?
Being able to archive chunks is a fairly high priority for me.  See CHUKWA-317.

> Re-implement Hourly & Daily rolling
> -----------------------------------
>
>                 Key: CHUKWA-311
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-311
>             Project: Hadoop Chukwa
>          Issue Type: Improvement
>            Reporter: Jerome Boulon
>
> Hourly and Daily rolling are currently done using a M/R but all spill files 
> are already sorted so it's just a Merged sort.
> Doing that from a standalone application will be more efficient than using a 
> M/R.
> Another way to implement this will be to take advantage of the latest version 
> of Pig (multiple queries optimization) and do the rolling once a day at the 
> same time as we are computing daily metrics (Since the data has already been 
> loaded by pig).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to