Re-implement Hourly & Daily rolling
-----------------------------------

                 Key: CHUKWA-311
                 URL: https://issues.apache.org/jira/browse/CHUKWA-311
             Project: Hadoop Chukwa
          Issue Type: Improvement
            Reporter: Jerome Boulon


Hourly and Daily rolling are currently done using a M/R but all spill files are 
already sorted so it's just a Merged sort.
Doing that from a standalone application will be more efficient than using a 
M/R.

Another way to implement this will be to take advantage of the latest version 
of Pig (multiple queries optimization) and do the rolling once a day at the 
same time as we are computing daily metrics (Since the data has already been 
loaded by pig).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to