Need a standalone JobHistory log anonymizer
-------------------------------------------
Key: MAPREDUCE-778
URL: https://issues.apache.org/jira/browse/MAPREDUCE-778
Project: Hadoop Map/Reduce
Issue Type: New Feature
Reporter: Hong Tang
Job history logs contain a rich set of information that can help understand and
characterize cluster workload and individual job execution. Examples of work
that parses or utilizes job history include HADOOP-3585, MAPREDUCE-534,
HDFS-459, MAPREDUCE-728, and MAPREDUCE-776. Some of the parsing tools developed
in previous work already contains a component to anonymize the logs. It would
be nice to combine these effort and have a common standalone tool that can
anonymizes job history logs and preserve much of the structure of the files so
that existing tools on top of job history logs continue work with no
modification.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.