This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git
The following commit(s) were added to refs/heads/asf-site by this push: new 349be47 [MINOR] add impala release and spark partition discovery (#1651) 349be47 is described below commit 349be47d8830489bc8c3d130683ad561ea8005ca Author: Gary Li <yanjia.gary...@gmail.com> AuthorDate: Wed May 20 21:44:35 2020 -0700 [MINOR] add impala release and spark partition discovery (#1651) --- docs/_docs/0.5.2/2_3_querying_data.cn.md | 2 +- docs/_docs/0.5.2/2_3_querying_data.md | 2 +- docs/_docs/1_1_quick_start_guide.cn.md | 2 ++ docs/_docs/1_1_quick_start_guide.md | 2 ++ docs/_docs/2_3_querying_data.cn.md | 2 +- docs/_docs/2_3_querying_data.md | 2 +- 6 files changed, 8 insertions(+), 4 deletions(-) diff --git a/docs/_docs/0.5.2/2_3_querying_data.cn.md b/docs/_docs/0.5.2/2_3_querying_data.cn.md index 77ad2d7..d37d8f2 100644 --- a/docs/_docs/0.5.2/2_3_querying_data.cn.md +++ b/docs/_docs/0.5.2/2_3_querying_data.cn.md @@ -176,7 +176,7 @@ scala> sqlContext.sql("select count(*) from hudi_rt where datestr = '2016-10-02' Presto是一种常用的查询引擎,可提供交互式查询性能。 Hudi RO表可以在Presto中无缝查询。 这需要在整个安装过程中将`hudi-presto-bundle` jar放入`<presto_install>/plugin/hive-hadoop2/`中。 -## Impala(此功能还未正式发布) +## Impala (3.4 or later) ### 读优化表 diff --git a/docs/_docs/0.5.2/2_3_querying_data.md b/docs/_docs/0.5.2/2_3_querying_data.md index 9d17e72..00c8a48 100644 --- a/docs/_docs/0.5.2/2_3_querying_data.md +++ b/docs/_docs/0.5.2/2_3_querying_data.md @@ -171,7 +171,7 @@ Additionally, `HoodieReadClient` offers the following functionality using Hudi's Presto is a popular query engine, providing interactive query performance. Presto currently supports snapshot queries on COPY_ON_WRITE and read optimized queries on MERGE_ON_READ Hudi tables. This requires the `hudi-presto-bundle` jar to be placed into `<presto_install>/plugin/hive-hadoop2/`, across the installation. -## Impala (Not Officially Released) +## Impala (3.4 or later) ### Snapshot Query diff --git a/docs/_docs/1_1_quick_start_guide.cn.md b/docs/_docs/1_1_quick_start_guide.cn.md index f20e212..9137f91 100644 --- a/docs/_docs/1_1_quick_start_guide.cn.md +++ b/docs/_docs/1_1_quick_start_guide.cn.md @@ -70,6 +70,8 @@ val roViewDF = spark. read. format("org.apache.hudi"). load(basePath + "/*/*/*/*") + //load(basePath) 如果使用 "/partitionKey=partitionValue" 文件夹命名格式,Spark将自动识别分区信息 + roViewDF.registerTempTable("hudi_ro_table") spark.sql("select fare, begin_lon, begin_lat, ts from hudi_ro_table where fare > 20.0").show() spark.sql("select _hoodie_commit_time, _hoodie_record_key, _hoodie_partition_path, rider, driver, fare from hudi_ro_table").show() diff --git a/docs/_docs/1_1_quick_start_guide.md b/docs/_docs/1_1_quick_start_guide.md index 3e088dd..62939cb 100644 --- a/docs/_docs/1_1_quick_start_guide.md +++ b/docs/_docs/1_1_quick_start_guide.md @@ -92,6 +92,7 @@ val tripsSnapshotDF = spark. read. format("hudi"). load(basePath + "/*/*/*/*") +//load(basePath) use "/partitionKey=partitionValue" folder structure for Spark auto partition discovery tripsSnapshotDF.createOrReplaceTempView("hudi_trips_snapshot") spark.sql("select fare, begin_lon, begin_lat, ts from hudi_trips_snapshot where fare > 20.0").show() @@ -297,6 +298,7 @@ tripsSnapshotDF = spark. \ read. \ format("hudi"). \ load(basePath + "/*/*/*/*") +# load(basePath) use "/partitionKey=partitionValue" folder structure for Spark auto partition discovery tripsSnapshotDF.createOrReplaceTempView("hudi_trips_snapshot") diff --git a/docs/_docs/2_3_querying_data.cn.md b/docs/_docs/2_3_querying_data.cn.md index 1fa91d1..0aeb104 100644 --- a/docs/_docs/2_3_querying_data.cn.md +++ b/docs/_docs/2_3_querying_data.cn.md @@ -175,7 +175,7 @@ scala> sqlContext.sql("select count(*) from hudi_rt where datestr = '2016-10-02' Presto是一种常用的查询引擎,可提供交互式查询性能。 Hudi RO表可以在Presto中无缝查询。 这需要在整个安装过程中将`hudi-presto-bundle` jar放入`<presto_install>/plugin/hive-hadoop2/`中。 -## Impala(此功能还未正式发布) +## Impala (3.4 or later) ### 读优化表 diff --git a/docs/_docs/2_3_querying_data.md b/docs/_docs/2_3_querying_data.md index 3e6a436..568d3ba 100644 --- a/docs/_docs/2_3_querying_data.md +++ b/docs/_docs/2_3_querying_data.md @@ -170,7 +170,7 @@ Additionally, `HoodieReadClient` offers the following functionality using Hudi's Presto is a popular query engine, providing interactive query performance. Presto currently supports snapshot queries on COPY_ON_WRITE and read optimized queries on MERGE_ON_READ Hudi tables. This requires the `hudi-presto-bundle` jar to be placed into `<presto_install>/plugin/hive-hadoop2/`, across the installation. -## Impala (Not Officially Released) +## Impala (3.4 or later) ### Snapshot Query