[ 
https://issues.apache.org/jira/browse/HADOOP-3788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12630701#action_12630701
 ] 

Pete Wyckoff commented on HADOOP-3788:
--------------------------------------

bq. could imagine the deserializers being handed a special kind of stream that 
counts down the remaining bytes and then signals EOF.

One problem i ran into related to this is what if your deserializer does 
buffering?  For EOF, one can return null from deserialize or in theory throw 
EOFException :(, but for getProgress, RecordReader's use getPos which is off if 
the deserializer has its own buffers. e.g., if you were to implement 
LineReaderDeserializer using LineRecordReader.LineReader.

-- pete


> Add serialization for Protocol Buffers
> --------------------------------------
>
>                 Key: HADOOP-3788
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3788
>             Project: Hadoop Core
>          Issue Type: Wish
>          Components: examples, mapred
>    Affects Versions: 0.19.0
>            Reporter: Tom White
>            Assignee: Alex Loddengaard
>             Fix For: 0.19.0
>
>         Attachments: hadoop-3788-v1.patch, hadoop-3788-v2.patch, 
> protobuf-java-2.0.1.jar
>
>
> Protocol Buffers (http://code.google.com/p/protobuf/) are a way of encoding 
> data in a compact binary format. This issue is to write a 
> ProtocolBuffersSerialization to support using Protocol Buffers types in 
> MapReduce programs, including an example program. This should probably go 
> into contrib. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to