SequenceFileAsBinaryOutputFormat
--------------------------------
Key: HADOOP-3460
URL: https://issues.apache.org/jira/browse/HADOOP-3460
Project: Hadoop Core
Issue Type: New Feature
Components: mapred
Reporter: Koji Noguchi
Priority: Minor
Add an OutputFormat to write raw bytes as keys and values to a SequenceFile.
In C++-Pipes, we're using SequenceFileAsBinaryInputFormat to read Sequencefiles.
However, we current don't have a way to *write* a sequencefile efficiently
without going through extra (de)serializations.
I'd like to store the correct classnames for key/values but use BytesWritable
to write
(in order for the next java or pig code to be able to read this sequencefile).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.