Timothy Chen created HIVE-6143: ---------------------------------- Summary: Refactor Orc file format parsing logic to be shared Key: HIVE-6143 URL: https://issues.apache.org/jira/browse/HIVE-6143 Project: Hive Issue Type: Bug Reporter: Timothy Chen
Currently the Orc file format parsing logic is hidden in private methods in reader and record reader classes, for example footer parsing, stream loading, etc. For the Orc file format to be a more reusable file format outside of Hive, I suggest refactor these generic logic into a shared class. The current interface of reading per serialized as objects is not suffice as for columnar execution engines such as Drill/Impala, it's much more efficient to load in columnar data into its own columnar in memory formats. -- This message was sent by Atlassian JIRA (v6.1.5#6160)