Prasanth J created HIVE-6382:
--------------------------------

             Summary: PATCHED_BLOB encoding in ORC will corrupt the data in 
some cases
                 Key: HIVE-6382
                 URL: https://issues.apache.org/jira/browse/HIVE-6382
             Project: Hive
          Issue Type: Bug
          Components: Serializers/Deserializers
    Affects Versions: 0.13.0
            Reporter: Prasanth J
            Assignee: Prasanth J


In PATCHED_BLOB encoding, gapVsPatchList is an array of long that stores gap 
between the values that are patched (g) and the patch value (p). The maximum 
distance of gap can be 511 that require 8 bits to encode. And patch values can 
take more than 56 bits. When patch values take more than 56 bits, p + g will 
become > 64 bits which cannot be packed to a long. This will result in data 
corruption under the case where patch values are > 56 bits. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to