This is an automated email from the ASF dual-hosted git repository.
bhavanisudha pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 561fd0aeb16 [MINOR][DOCS] updated notes for spark-hive integrated
local environments (#10201)
561fd0aeb16 is described below
commit 561fd0aeb1625fa55ed9ce05b92c20fd936f74bb
Author: Sagar Lakshmipathy <[email protected]>
AuthorDate: Thu Dec 7 11:45:44 2023 -0800
[MINOR][DOCS] updated notes for spark-hive integrated local environments
(#10201)
* updated notes for spark-hive locally integrated environments
* moved note to sql_ddl page and reframed it
---
website/docs/quick-start-guide.md | 7 +++++--
website/docs/sql_ddl.md | 9 +++++++++
website/versioned_docs/version-0.14.0/quick-start-guide.md | 7 +++++--
website/versioned_docs/version-0.14.0/sql_ddl.md | 9 +++++++++
4 files changed, 28 insertions(+), 4 deletions(-)
diff --git a/website/docs/quick-start-guide.md
b/website/docs/quick-start-guide.md
index 6d3d161eba0..1ba7cb877cb 100644
--- a/website/docs/quick-start-guide.md
+++ b/website/docs/quick-start-guide.md
@@ -203,7 +203,10 @@ values={[
</TabItem>
<TabItem value="sparksql">
-
+:::note NOTE:
+For users who have Spark-Hive integration in their environment, this guide
assumes that you have the appropriate
+settings configured to allow Spark to create tables and register in Hive
Metastore.
+:::
Here is an example of creating a Hudi table.
@@ -220,7 +223,7 @@ CREATE TABLE hudi_table (
PARTITIONED BY (city);
```
-For more options for creating Hudi tables, please refer to [SQL
DDL](/docs/next/sql_ddl) reference guide.
+For more options for creating Hudi tables or if you're running into any
issues, please refer to [SQL DDL](/docs/next/sql_ddl) reference guide.
</TabItem>
diff --git a/website/docs/sql_ddl.md b/website/docs/sql_ddl.md
index 5b5fafe3962..e1f659f4c88 100644
--- a/website/docs/sql_ddl.md
+++ b/website/docs/sql_ddl.md
@@ -27,6 +27,15 @@ CREATE TABLE [IF NOT EXISTS] [db_name.]table_name
[AS select_statement];
```
+:::note NOTE:
+For users running this tutorial locally and have a Spark-Hive(HMS) integration
in their environment: If you use
+`default` database or if you don't provide `[LOCATION path]` with the DDL
statement, Spark will return
+`java.io.IOException: Mkdirs failed to create
file:/user/hive/warehouse/hudi_table/.hoodie` error.
+To get around this, you can follow either of the two options mentioned below:
+1. Create a database i.e. `CREATE DATABASE hudidb;` and use it i.e. `USE
hudidb;` before running the DDL statement.
+2. Or provide a path using `LOCATION` keyword to persist the data with the DDL
statement.
+:::
+
### Create non-partitioned table
Creating a non-partitioned table is as simple as creating a regular table.
diff --git a/website/versioned_docs/version-0.14.0/quick-start-guide.md
b/website/versioned_docs/version-0.14.0/quick-start-guide.md
index d05fb84b990..1512bab6e05 100644
--- a/website/versioned_docs/version-0.14.0/quick-start-guide.md
+++ b/website/versioned_docs/version-0.14.0/quick-start-guide.md
@@ -203,7 +203,10 @@ values={[
</TabItem>
<TabItem value="sparksql">
-
+:::note NOTE:
+For users who have Spark-Hive integration in their environment, this guide
assumes that you have the appropriate
+settings configured to allow Spark to create tables and register in Hive
Metastore.
+:::
Here is an example of creating a Hudi table.
@@ -220,7 +223,7 @@ CREATE TABLE hudi_table (
PARTITIONED BY (city);
```
-For more options for creating Hudi tables, please refer to [SQL
DDL](/docs/sql_ddl) reference guide.
+For more options for creating Hudi tables or if you're running into any
issues, please refer to [SQL DDL](/docs/sql_ddl) reference guide.
</TabItem>
diff --git a/website/versioned_docs/version-0.14.0/sql_ddl.md
b/website/versioned_docs/version-0.14.0/sql_ddl.md
index 0fda74d957b..2211a5b88c4 100644
--- a/website/versioned_docs/version-0.14.0/sql_ddl.md
+++ b/website/versioned_docs/version-0.14.0/sql_ddl.md
@@ -27,6 +27,15 @@ CREATE TABLE [IF NOT EXISTS] [db_name.]table_name
[AS select_statement];
```
+:::note NOTE:
+For users running this tutorial locally and have a Spark-Hive(HMS) integration
in their environment: If you use
+`default` database or if you don't provide `[LOCATION path]` with the DDL
statement, Spark will return
+`java.io.IOException: Mkdirs failed to create
file:/user/hive/warehouse/hudi_table/.hoodie` error.
+To get around this, you can follow either of the two options mentioned below:
+1. Create a database i.e. `CREATE DATABASE hudidb;` and use it i.e. `USE
hudidb;` before running the DDL statement.
+2. Or provide a path using `LOCATION` keyword to persist the data with the DDL
statement.
+:::
+
### Create non-partitioned table
Creating a non-partitioned table is as simple as creating a regular table.