Separating hadoop metrics log types at source for improving primary key lookup
------------------------------------------------------------------------------

                 Key: CHUKWA-497
                 URL: https://issues.apache.org/jira/browse/CHUKWA-497
             Project: Chukwa
          Issue Type: Improvement
          Components: data collection
    Affects Versions: 0.4.0, 0.3.0, 0.1.2
         Environment: Java 6, RHEL 5.x
            Reporter: Eric Yang


Chukwa Hadoop metrics log4j appender is using HadoopLog as a generic type to 
capture all metrics from various metrics sources. i.e. JobTracker, TaskTracker, 
Shuffle, Map, Reduce, NameNode, and DataNode, etc.  It is best to make 
separation between those types, hence each data type can have it's own properly 
assigned key in demux phase by annotation.  

- Shuffle, Map, Reduce, have key associated with task attempts or Job.
- JobTracker or TaskTracker have key associated with a host.
- NameNode or DataNode have key associated with a  host.

HadoopMetricsProcessor should be deprecated in favor of separated demux classes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to