Quanlong Huang has posted comments on this change. (
Change subject: IMPALA-5717: Support for ORC data files
Patch Set 3:
PS3, Line 15: Instead of linking the orc-reader as a third party library, it's
> Which version or commit hash of ORC did you import?
The version is 1.2.3. It's a little old since we started the Impala-ORC project
one year ago. I can update it to the latest version if you think it's essential.
As for using it as a third-party library, I also concern about the memory
paradigm gap between the ORC lib and Impala. On the other hand, I can try to
reduce the size of this patch, e.g. remove useless codes like ColumnPrinter,
reuse as more Impala codes as possible e.g. RLE decoders, int128. My goal can
be making this reader no more complex than parquet column readers.
If finally you still decide to add it to the native-toolchain project, could
you give me some docs of how to submit code review for that project? Is it just
the same as Impala?
PS3, Line 25: tests.
> We should also add ORC for TPC-H and TPC-DS so that we have some larger dat
Sure, actually we have tested them but I forget to add this in the patch.
To view, visit http://gerrit.cloudera.org:8080/9134
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings
Gerrit-Owner: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>
Gerrit-Comment-Date: Sat, 10 Feb 2018 00:34:49 +0000