Zoltan Borok-Nagy has posted comments on this change. ( http://gerrit.cloudera.org:8080/15418 )
Change subject: IMPALA-9490 [DOCS] State support for reading Apache Hudi optimized table ...................................................................... Patch Set 1: (3 comments) http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_file_formats.xml File docs/topics/impala_file_formats.xml: http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_file_formats.xml@189 PS1, Line 189: LOAD DATA LOAD DATA doesn't work for Hudi because it has hidden files that LOAD DATA ignores. I think in production they'd typically create an external table in Impala and set the location of the table to point to the directory of the Hudi table. Or just create the Hudi table with Hive. http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml File docs/topics/impala_hudi.xml: http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml@39 PS1, Line 39: >Hudi support in Impala</indexterm> Impala supports reading : optimized Hudi Using the Hudi project's terminology I think it would be something like "Impala supports "Read Optimized Queries" on Hudi tables." Maybe we could then add this from the Hudi docs: "Read Optimized Queries see the latest snapshot of table as of a given commit/compaction action. Exposes only the base/columnar files in latest file slices and guarantees the same columnar query performance compared to a non-hudi columnar table." http://gerrit.cloudera.org:8080/#/c/15418/1/docs/topics/impala_hudi.xml@41 PS1, Line 41: Hudi ReadOptimized Table Hudi doesn't have different kind of tables, but provides different views of that table, and one view is "Read optimized". -- To view, visit http://gerrit.cloudera.org:8080/15418 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: If210cd545a8deb059e66fd36e62e0df4402fc96c Gerrit-Change-Number: 15418 Gerrit-PatchSet: 1 Gerrit-Owner: Kristine Hahn <[email protected]> Gerrit-Reviewer: Impala Public Jenkins <[email protected]> Gerrit-Reviewer: Tim Armstrong <[email protected]> Gerrit-Reviewer: Zoltan Borok-Nagy <[email protected]> Gerrit-Comment-Date: Thu, 12 Mar 2020 13:37:06 +0000 Gerrit-HasComments: Yes
