Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15418 )

Change subject: IMPALA-9490 [DOCS] State support for reading Apache Hudi 
optimized table
......................................................................


Patch Set 1:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_file_formats.xml
File docs/topics/impala_file_formats.xml:

http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_file_formats.xml@189
PS1, Line 189: LOAD DATA
LOAD DATA doesn't work for Hudi because it has hidden files that LOAD DATA 
ignores.

I think in production they'd typically create an external table in Impala and 
set the location of the table to point to the directory of the Hudi table. Or 
just create the Hudi table with Hive.


http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml
File docs/topics/impala_hudi.xml:

http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml@39
PS1, Line 39: >Hudi support in Impala</indexterm> Impala supports reading
            :       optimized Hudi
Using the Hudi project's terminology I think it would be something like "Impala 
supports "Read Optimized Queries" on Hudi tables."

Maybe we could then add this from the Hudi docs:
"Read Optimized Queries see the latest snapshot of table as of a given 
commit/compaction action. Exposes only the base/columnar files in latest file 
slices and guarantees the same columnar query performance compared to a 
non-hudi columnar table."


http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml@41
PS1, Line 41: Hudi ReadOptimized Table
Hudi doesn't have different kind of tables, but provides different views of 
that table, and one view is "Read optimized".



--
To view, visit http://gerrit.cloudera.org:8080/15418
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If210cd545a8deb059e66fd36e62e0df4402fc96c
Gerrit-Change-Number: 15418
Gerrit-PatchSet: 1
Gerrit-Owner: Kristine Hahn <[email protected]>
Gerrit-Reviewer: Impala Public Jenkins <[email protected]>
Gerrit-Reviewer: Tim Armstrong <[email protected]>
Gerrit-Reviewer: Zoltan Borok-Nagy <[email protected]>
Gerrit-Comment-Date: Thu, 12 Mar 2020 13:37:06 +0000
Gerrit-HasComments: Yes

Reply via email to