[incubator-hudi] branch asf-site updated: [MINOR] add impala release and spark partition discovery (#1651)

leesf Wed, 20 May 2020 21:45:22 -0700

This is an automated email from the ASF dual-hosted git repository.

leesf pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git



The following commit(s) were added to refs/heads/asf-site by this push:
     new 349be47  [MINOR] add impala release and spark partition discovery 
(#1651)
349be47 is described below

commit 349be47d8830489bc8c3d130683ad561ea8005ca
Author: Gary Li <yanjia.gary...@gmail.com>
AuthorDate: Wed May 20 21:44:35 2020 -0700

    [MINOR] add impala release and spark partition discovery (#1651)
---
 docs/_docs/0.5.2/2_3_querying_data.cn.md | 2 +-
 docs/_docs/0.5.2/2_3_querying_data.md    | 2 +-
 docs/_docs/1_1_quick_start_guide.cn.md   | 2 ++
 docs/_docs/1_1_quick_start_guide.md      | 2 ++
 docs/_docs/2_3_querying_data.cn.md       | 2 +-
 docs/_docs/2_3_querying_data.md          | 2 +-
 6 files changed, 8 insertions(+), 4 deletions(-)

diff --git a/docs/_docs/0.5.2/2_3_querying_data.cn.md 
b/docs/_docs/0.5.2/2_3_querying_data.cn.md
index 77ad2d7..d37d8f2 100644
--- a/docs/_docs/0.5.2/2_3_querying_data.cn.md
+++ b/docs/_docs/0.5.2/2_3_querying_data.cn.md
@@ -176,7 +176,7 @@ scala> sqlContext.sql("select count(*) from hudi_rt where 
datestr = '2016-10-02'
 Presto是一种常用的查询引擎，可提供交互式查询性能。 Hudi RO表可以在Presto中无缝查询。
 这需要在整个安装过程中将`hudi-presto-bundle` jar放入`<presto_install>/plugin/hive-hadoop2/`中。
 
-## Impala(此功能还未正式发布)
+## Impala (3.4 or later)
 
 ### 读优化表
 
diff --git a/docs/_docs/0.5.2/2_3_querying_data.md 
b/docs/_docs/0.5.2/2_3_querying_data.md
index 9d17e72..00c8a48 100644
--- a/docs/_docs/0.5.2/2_3_querying_data.md
+++ b/docs/_docs/0.5.2/2_3_querying_data.md
@@ -171,7 +171,7 @@ Additionally, `HoodieReadClient` offers the following 
functionality using Hudi's
 Presto is a popular query engine, providing interactive query performance. 
Presto currently supports snapshot queries on COPY_ON_WRITE and read optimized 
queries 
 on MERGE_ON_READ Hudi tables. This requires the `hudi-presto-bundle` jar to be 
placed into `<presto_install>/plugin/hive-hadoop2/`, across the installation.
 
-## Impala (Not Officially Released)
+## Impala (3.4 or later)
 
 ### Snapshot Query
 
diff --git a/docs/_docs/1_1_quick_start_guide.cn.md 
b/docs/_docs/1_1_quick_start_guide.cn.md
index f20e212..9137f91 100644
--- a/docs/_docs/1_1_quick_start_guide.cn.md
+++ b/docs/_docs/1_1_quick_start_guide.cn.md
@@ -70,6 +70,8 @@ val roViewDF = spark.
     read.
     format("org.apache.hudi").
     load(basePath + "/*/*/*/*")
+    //load(basePath) 如果使用 "/partitionKey=partitionValue" 文件夹命名格式，Spark将自动识别分区信息
+
 roViewDF.registerTempTable("hudi_ro_table")
 spark.sql("select fare, begin_lon, begin_lat, ts from  hudi_ro_table where 
fare > 20.0").show()
 spark.sql("select _hoodie_commit_time, _hoodie_record_key, 
_hoodie_partition_path, rider, driver, fare from  hudi_ro_table").show()
diff --git a/docs/_docs/1_1_quick_start_guide.md 
b/docs/_docs/1_1_quick_start_guide.md
index 3e088dd..62939cb 100644
--- a/docs/_docs/1_1_quick_start_guide.md
+++ b/docs/_docs/1_1_quick_start_guide.md
@@ -92,6 +92,7 @@ val tripsSnapshotDF = spark.
   read.
   format("hudi").
   load(basePath + "/*/*/*/*")
+//load(basePath) use "/partitionKey=partitionValue" folder structure for Spark 
auto partition discovery
 tripsSnapshotDF.createOrReplaceTempView("hudi_trips_snapshot")
 
 spark.sql("select fare, begin_lon, begin_lat, ts from  hudi_trips_snapshot 
where fare > 20.0").show()
@@ -297,6 +298,7 @@ tripsSnapshotDF = spark. \
   read. \
   format("hudi"). \
   load(basePath + "/*/*/*/*")
+# load(basePath) use "/partitionKey=partitionValue" folder structure for Spark 
auto partition discovery
 
 tripsSnapshotDF.createOrReplaceTempView("hudi_trips_snapshot")
 
diff --git a/docs/_docs/2_3_querying_data.cn.md 
b/docs/_docs/2_3_querying_data.cn.md
index 1fa91d1..0aeb104 100644
--- a/docs/_docs/2_3_querying_data.cn.md
+++ b/docs/_docs/2_3_querying_data.cn.md
@@ -175,7 +175,7 @@ scala> sqlContext.sql("select count(*) from hudi_rt where 
datestr = '2016-10-02'
 Presto是一种常用的查询引擎，可提供交互式查询性能。 Hudi RO表可以在Presto中无缝查询。
 这需要在整个安装过程中将`hudi-presto-bundle` jar放入`<presto_install>/plugin/hive-hadoop2/`中。
 
-## Impala(此功能还未正式发布)
+## Impala (3.4 or later)
 
 ### 读优化表
 
diff --git a/docs/_docs/2_3_querying_data.md b/docs/_docs/2_3_querying_data.md
index 3e6a436..568d3ba 100644
--- a/docs/_docs/2_3_querying_data.md
+++ b/docs/_docs/2_3_querying_data.md
@@ -170,7 +170,7 @@ Additionally, `HoodieReadClient` offers the following 
functionality using Hudi's
 Presto is a popular query engine, providing interactive query performance. 
Presto currently supports snapshot queries on COPY_ON_WRITE and read optimized 
queries 
 on MERGE_ON_READ Hudi tables. This requires the `hudi-presto-bundle` jar to be 
placed into `<presto_install>/plugin/hive-hadoop2/`, across the installation.
 
-## Impala (Not Officially Released)
+## Impala (3.4 or later)
 
 ### Snapshot Query

[incubator-hudi] branch asf-site updated: [MINOR] add impala release and spark partition discovery (#1651)

Reply via email to