Siddharth Seth created HIVE-14800:

             Summary: Handle off by 3 in ORC split generation based on split 
strategy used
                 Key: HIVE-14800
             Project: Hive
          Issue Type: Bug
            Reporter: Siddharth Seth

BI will apparently generate splits starting at offset 0.
ETL will skip the ORC header and generate a split starting at offset 3.

There's a workaround in the HiveSplitGenreator to handle this for consistent 
splits. Ideally, Orc split generation should take care of this.

cc [~prasanth_j], [~gopalv]

This message was sent by Atlassian JIRA

Reply via email to