Hi Radu,

none of Flink's serializers adds checksums to ensure data integrity.
It would be possible to implement a wrapping serializer that adds a
checksum to each record, but that would be at the cost of performance.
Not sure if this is done at some point in Flink, maybe for savepoints.

Best, Fabian

2018-02-23 14:44 GMT+01:00 Radu Tudoran <radu.tudo...@huawei.com>:

> Hi,
>
>
>
> I was using the RowSerializer (package org.apache.flink.api.java.
> typeutils.runtime;) recently to serialize Rows to file (for reading them
> back in the future).
>
> I observed a strange behavior that I would like to double check with you
> in case this is a serious problem to be addressed:
>
> When the rowserializer is used to convert data back, there is no check for
> the consistency of the data (e.g., size of the object that was serialized,
> checksum…). This leads to situation that for random reads of bytes it can
> happen that inconsistent objects are deserialized, which of course can lead
> to inconsistent data.
>
>
>
> For example if we serialize object of the form (Int, Long, Double, String
> String) – if we have available only 1/3 of the bytes, we can end up reading
> back objects  as (0,0,0,null, null) – this is not the only example of how
> the object can be incorrectly deserialized …rather than having an error for
> this process.
>
>
>
> Hence, I wanted to double check if this is an intended behavior for some
> reason and if we should consider fixing the rowserializer to guarantee
> integrity of the objects that are deserialized.
>
>
>
> Best regards,
>
>
>
> Dr. Radu Tudoran
>
> Staff Research Engineer - Big Data Expert
>
> IT R&D Division
>
>
>
> [image: cid:image007.jpg@01CD52EB.AD060EE0]
>
> HUAWEI TECHNOLOGIES Duesseldorf GmbH
>
> German Research Center
>
> Munich Office
>
> Riesstrasse 25, 80992
> <https://maps.google.com/?q=Riesstrasse+25,+80992&entry=gmail&source=g>
> München
>
>
>
> E-mail: *radu.tudo...@huawei.com <radu.tudo...@huawei.com>*
>
> Mobile: +49 15209084330 <+49%201520%209084330>
>
> Telephone: +49 891588344173 <+49%2089%201588344173>
>
>
>
> HUAWEI TECHNOLOGIES Duesseldorf GmbH
> Hansaallee 205, 40549 Düsseldorf, Germany
> <https://maps.google.com/?q=Hansaallee+205,+40549+D%C3%BCsseldorf,+Germany&entry=gmail&source=g>,
> www.huawei.com
> Registered Office: Düsseldorf, Register Court Düsseldorf, HRB 56063,
> Managing Director: Bo PENG, Qiuen Peng, Shengli Wang
> Sitz der Gesellschaft: Düsseldorf, Amtsgericht Düsseldorf, HRB 56063,
> Geschäftsführer: Bo PENG, Qiuen Peng, Shengli Wang
>
> This e-mail and its attachments contain confidential information from
> HUAWEI, which is intended only for the person or entity whose address is
> listed above. Any use of the information contained herein in any way
> (including, but not limited to, total or partial disclosure, reproduction,
> or dissemination) by persons other than the intended recipient(s) is
> prohibited. If you receive this e-mail in error, please notify the sender
> by phone or email immediately and delete it!
>
>
>

Reply via email to