Timothy Chen created HIVE-6143:
----------------------------------

             Summary: Refactor Orc file format parsing logic to be shared
                 Key: HIVE-6143
                 URL: https://issues.apache.org/jira/browse/HIVE-6143
             Project: Hive
          Issue Type: Bug
            Reporter: Timothy Chen


Currently the Orc file format parsing logic is hidden in private methods in 
reader and record reader classes, for example footer parsing, stream loading, 
etc.

For the Orc file format to be a more reusable file format outside of Hive, I 
suggest refactor these generic logic into a shared class.

The current interface of reading per serialized as objects is not suffice as 
for columnar execution engines such as Drill/Impala, it's much more efficient 
to load in columnar data into its own columnar in memory formats.





--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to