Please reply-all so the mailing list stays CC'd.  I don't know anything
about the libraries you are using so I can't really help you further.  Maybe
someone else can.

On Thu, Feb 18, 2010 at 12:46 PM, Yang <teddyyyy...@gmail.com> wrote:

> thanks Kenton,
>
> I thought about the same,
> what I did was that I use a splitter stream, and split the actual input
> stream into 2, dumping out one for debugging, and feeding the other one to
> PB.
>
>
> my code for Hadoop is
>
> Writable.readFields( Datainput in ) {
>
>     SplitterInputStream ios = new SplitterInputStream(in);
>
>     pb_object = MyPBClass.parseFrom(ios);
> }
>
> SplitterInputStream dumps out the actual bytes, and the resulting byte
> stream is
> indeed (decimal)
>
> 10 2 79 79  16 1  ... repeating 20 times\
>
> which is 20 records of
> message {
>   1: string name ;  // taking a value of "yy"
>   2: i32     Id;   //taking a value of 1
> }
>
>
>
> indeed, in compression or non-compression mode, the dumped out bytestream
> is the same.
>
>
>
> On Thu, Feb 18, 2010 at 12:03 PM, Kenton Varda <ken...@google.com> wrote:
>
>> You should verify that the bytes that come out of the InputStream really
>> are the exact same bytes that were written by the serializer to the
>> OutputStream originally.  You could do this by computing a checksum at both
>> ends and printing it, then inspecting visually.  You'll probably find that
>> the bytes differ somehow, or don't end at the same point.
>>
>> On Thu, Feb 18, 2010 at 2:48 AM, Yang <teddyyyy...@gmail.com> wrote:
>>
>>> I tried to use protocol buffer in hadoop,
>>>
>>> so far it works fine with SequenceFile, after I hook it up with a simple
>>> wrapper,
>>>
>>> but after I put in a compressor in sequenceFile, it fails, because it
>>> read all the messages and yet still wants to advance the read pointer, and
>>> then readTag() returns 0, so the mergeFrom() returns a message with no
>>> fields set.
>>>
>>> anybody familiar with both SequenceFile and protocol buffer has an idea
>>> why it fails like this?
>>> I find it difficult to understand because the InputStream is simply the
>>> same, whether it comes through a compressor or not
>>>
>>>
>>> thanks
>>> Yang
>>>
>>> --
>>> You received this message because you are subscribed to the Google Groups
>>> "Protocol Buffers" group.
>>> To post to this group, send email to proto...@googlegroups.com.
>>> To unsubscribe from this group, send email to
>>> protobuf+unsubscr...@googlegroups.com<protobuf%2bunsubscr...@googlegroups.com>
>>> .
>>> For more options, visit this group at
>>> http://groups.google.com/group/protobuf?hl=en.
>>>
>>
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Protocol Buffers" group.
To post to this group, send email to proto...@googlegroups.com.
To unsubscribe from this group, send email to 
protobuf+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/protobuf?hl=en.

Reply via email to