[
https://issues.apache.org/jira/browse/MAPREDUCE-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071061#comment-14071061
]
Antonio Piccolboni commented on MAPREDUCE-1767:
-----------------------------------------------
Such as?
> Steaming infrastructures should provide statisics about job
> -----------------------------------------------------------
>
> Key: MAPREDUCE-1767
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1767
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: contrib/streaming
> Reporter: arkady borkovsky
>
> This should include
> -- the commands (mapper and reducer commands) executed
> -- time information (e.g. min, max, and avg start time, end time, elapsed
> time for tasks, total elapsed time )
> -- sizes -- bytes and records, min, max, avg per task and total, input and
> output
> -- information about input and output data sets (all output data sets, if
> there are several)
> -- all user counters (when they are implemented for streaming)
> the information should be stored in a file -- e.g. in the working directory
> from where the job was launched, with a name derived from the job name
--
This message was sent by Atlassian JIRA
(v6.2#6252)