Steaming infrastructures should provide statisics about job
-----------------------------------------------------------

                 Key: HADOOP-2241
                 URL: https://issues.apache.org/jira/browse/HADOOP-2241
             Project: Hadoop
          Issue Type: Improvement
            Reporter: arkady borkovsky


This should include
-- the commands (mapper and reducer commands) executed
-- time information (e.g. min, max, and avg start time, end time, elapsed time 
for tasks, total elapsed time )
-- sizes -- bytes and records, min, max, avg per task and total, input and 
output
-- information about input and output data sets (all output data sets, if there 
are several)
-- all user counters (when they are implemented for streaming)

the information should be stored in a file -- e.g. in the working directory 
from where the job was launched, with a name derived from the job name




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to