Steaming infrastructures should provide statisics about job -----------------------------------------------------------
Key: HADOOP-2241 URL: https://issues.apache.org/jira/browse/HADOOP-2241 Project: Hadoop Issue Type: Improvement Reporter: arkady borkovsky This should include -- the commands (mapper and reducer commands) executed -- time information (e.g. min, max, and avg start time, end time, elapsed time for tasks, total elapsed time ) -- sizes -- bytes and records, min, max, avg per task and total, input and output -- information about input and output data sets (all output data sets, if there are several) -- all user counters (when they are implemented for streaming) the information should be stored in a file -- e.g. in the working directory from where the job was launched, with a name derived from the job name -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.