Improvement of RandomWriter example to use custom InputFormat, OutputFormat,
and Counters
-----------------------------------------------------------------------------------------
Key: HADOOP-1040
URL: https://issues.apache.org/jira/browse/HADOOP-1040
Project: Hadoop
Issue Type: Improvement
Components: examples
Affects Versions: 0.11.2
Reporter: Owen O'Malley
Assigned To: Owen O'Malley
Fix For: 0.12.0
Attachments: random-writer.patch
It would be good if the RandomWriter example used custom InputFormat and
OutputFormats rather than creating temporary files in DFS. It is not only
faster, it provides a better example to users of how to handle programs that
don't have any input. (It still uses FileSplits rather than custom InputSplits,
since the FileSplits would do want I need.)
I also added Counters to the example to count records and bytes written.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.