Csaba Ringhofer created IMPALA-9579:
---------------------------------------
Summary: Read support for binary in ORC
Key: IMPALA-9579
URL: https://issues.apache.org/jira/browse/IMPALA-9579
Project: IMPALA
Issue Type: Sub-task
Components: Backend
Reporter: Csaba Ringhofer
ORC has its own BINARY type, which has some differences compared to
STRING/VARCHAR/CHAR, as BINARY:
- doesn't use dictionary encoding
- doesn't store min/max values in the statistics
The c++ library uses the same ColumnReader as for STRING, so the implementation
efforts should be minimal:
https://github.com/apache/orc/blob/a9ec6a2e39ed71ef8a2d874df14700956aa847be/c%2B%2B/src/ColumnReader.cc#L1752
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]