This is an automated email from the ASF dual-hosted git repository.
danny0405 pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/asf-site by this push:
new dbd66363028 [DOCS] Update Trino engine support (#8769)
dbd66363028 is described below
commit dbd66363028137ba71a9c4acce56bd05f1180152
Author: Sagar Sumit <[email protected]>
AuthorDate: Mon May 22 06:51:31 2023 +0530
[DOCS] Update Trino engine support (#8769)
---
website/docs/querying_data.md | 44 ++++++++++++++--------
.../version-0.12.0/query_engine_setup.md | 37 +++++++++++++++---
.../version-0.12.1/query_engine_setup.md | 37 +++++++++++++++---
.../version-0.12.2/query_engine_setup.md | 41 ++++++++++++++------
.../version-0.12.3/query_engine_setup.md | 41 ++++++++++++++------
.../version-0.13.0/query_engine_setup.md | 41 ++++++++++++++------
6 files changed, 177 insertions(+), 64 deletions(-)
diff --git a/website/docs/querying_data.md b/website/docs/querying_data.md
index 8b12595e19a..5e7b62c2cdd 100644
--- a/website/docs/querying_data.md
+++ b/website/docs/querying_data.md
@@ -320,25 +320,37 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
```
## Trino
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
-Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using Hive connector or the native
-Hudi connector. If you're on Trino version **398** or higher, it is
recommended to use the Hudi connector. To learn more
-about the usage of Hudi connector, please check out
-the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
-terms of query support, i.e. 'Snapshot' queries for Copy-On-Write tables and
-'Read Optimized' queries for Merge-On-Read tables.
+### Hive Connector
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot'
-queries for Copy-On-Write tables and 'Read Optimized' queries for
Merge-On-Read tables. This is through the initial
-input format based integration in PrestoDB (pre forking). This approach has
known performance limitations with very
-large tables, which has been since fixed on PrestoDB. We recommend using the
new Hudi connector in Trino (released since
-Trino version 398).
-:::
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
-To query Hudi tables on Trino using the Hive connector, place
-the
[hudi-trino-bundle](https://mvnrepository.com/artifact/org.apache.hudi/hudi-trino-bundle)
jar into the Hive
-connector installation `<trino_install>/plugin/hive`.
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Impala (3.4 or later)
diff --git a/website/versioned_docs/version-0.12.0/query_engine_setup.md
b/website/versioned_docs/version-0.12.0/query_engine_setup.md
index a581337d0d3..79dfaf81233 100644
--- a/website/versioned_docs/version-0.12.0/query_engine_setup.md
+++ b/website/versioned_docs/version-0.12.0/query_engine_setup.md
@@ -70,13 +70,38 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
```
## Trino
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot' queries for Copy-On-Write tables and
'Read Optimized' queries
-for Merge-On-Read tables. This is through the initial input format based
integration in PrestoDB (pre forking). This approach has
-known performance limitations with very large tables, which has been since
fixed on PrestoDB. We are working on replicating the same fixes on Trino as
well.
-:::
-To query Hudi tables on Trino, please place the `hudi-trino-bundle` jar into
the Hive connector installation `<trino_install>/plugin/hive-hadoop2`.
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
+
+### Hive Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
+
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Hive
diff --git a/website/versioned_docs/version-0.12.1/query_engine_setup.md
b/website/versioned_docs/version-0.12.1/query_engine_setup.md
index 0e79d2cb7d8..03a5c0ebe8c 100644
--- a/website/versioned_docs/version-0.12.1/query_engine_setup.md
+++ b/website/versioned_docs/version-0.12.1/query_engine_setup.md
@@ -70,13 +70,38 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
```
## Trino
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot' queries for Copy-On-Write tables and
'Read Optimized' queries
-for Merge-On-Read tables. This is through the initial input format based
integration in PrestoDB (pre forking). This approach has
-known performance limitations with very large tables, which has been since
fixed on PrestoDB. We are working on replicating the same fixes on Trino as
well.
-:::
-To query Hudi tables on Trino, please place the `hudi-trino-bundle` jar into
the Hive connector installation `<trino_install>/plugin/hive-hadoop2`.
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
+
+### Hive Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
+
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Hive
diff --git a/website/versioned_docs/version-0.12.2/query_engine_setup.md
b/website/versioned_docs/version-0.12.2/query_engine_setup.md
index 5df8c7623b8..d12c3f426cd 100644
--- a/website/versioned_docs/version-0.12.2/query_engine_setup.md
+++ b/website/versioned_docs/version-0.12.2/query_engine_setup.md
@@ -71,20 +71,37 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
## Trino
-Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using Hive connector or the native
-Hudi connector. If you're on Trino version **398** or higher, it is
recommended to use the Hudi connector. To learn more
-about the usage of Hudi connector, please check out the [connector
documentation](https://trino.io/docs/current/connector/hudi.html).
-Both the connectors are on par in terms of query support, i.e. 'Snapshot'
queries for Copy-On-Write tables and
-'Read Optimized' queries for Merge-On-Read tables.
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot' queries for Copy-On-Write tables and
'Read Optimized' queries
-for Merge-On-Read tables. This is through the initial input format based
integration in PrestoDB (pre forking). This approach has
-known performance limitations with very large tables, which has been since
fixed on PrestoDB.
-We recommend using the new Hudi connector in Trino (released since Trino
version 398).
-:::
+### Hive Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
-To query Hudi tables on Trino, please place the `hudi-trino-bundle` jar into
the Hive connector installation `<trino_install>/plugin/hive-hadoop2`.
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Hive
diff --git a/website/versioned_docs/version-0.12.3/query_engine_setup.md
b/website/versioned_docs/version-0.12.3/query_engine_setup.md
index 5df8c7623b8..d12c3f426cd 100644
--- a/website/versioned_docs/version-0.12.3/query_engine_setup.md
+++ b/website/versioned_docs/version-0.12.3/query_engine_setup.md
@@ -71,20 +71,37 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
## Trino
-Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using Hive connector or the native
-Hudi connector. If you're on Trino version **398** or higher, it is
recommended to use the Hudi connector. To learn more
-about the usage of Hudi connector, please check out the [connector
documentation](https://trino.io/docs/current/connector/hudi.html).
-Both the connectors are on par in terms of query support, i.e. 'Snapshot'
queries for Copy-On-Write tables and
-'Read Optimized' queries for Merge-On-Read tables.
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot' queries for Copy-On-Write tables and
'Read Optimized' queries
-for Merge-On-Read tables. This is through the initial input format based
integration in PrestoDB (pre forking). This approach has
-known performance limitations with very large tables, which has been since
fixed on PrestoDB.
-We recommend using the new Hudi connector in Trino (released since Trino
version 398).
-:::
+### Hive Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
-To query Hudi tables on Trino, please place the `hudi-trino-bundle` jar into
the Hive connector installation `<trino_install>/plugin/hive-hadoop2`.
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Hive
diff --git a/website/versioned_docs/version-0.13.0/query_engine_setup.md
b/website/versioned_docs/version-0.13.0/query_engine_setup.md
index 5df8c7623b8..d12c3f426cd 100644
--- a/website/versioned_docs/version-0.13.0/query_engine_setup.md
+++ b/website/versioned_docs/version-0.13.0/query_engine_setup.md
@@ -71,20 +71,37 @@ Beginning query by connecting hive metastore with presto
client. The presto clie
## Trino
-Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using Hive connector or the native
-Hudi connector. If you're on Trino version **398** or higher, it is
recommended to use the Hudi connector. To learn more
-about the usage of Hudi connector, please check out the [connector
documentation](https://trino.io/docs/current/connector/hudi.html).
-Both the connectors are on par in terms of query support, i.e. 'Snapshot'
queries for Copy-On-Write tables and
-'Read Optimized' queries for Merge-On-Read tables.
+Just like PrestoDB, there are two ways to query Hudi tables using Trino i.e.
either using [Hive](https://trino.io/docs/current/connector/hive.html)
connector or the native
+[Hudi](https://trino.io/docs/current/connector/hudi.html) connector (available
since version 398). However, since version 411, Hive connector redirects to
Hudi catalog for reading Hudi tables.
-:::note
-[Trino](https://trino.io/) (formerly PrestoSQL) was forked off of PrestoDB a
few years ago. Hudi supports 'Snapshot' queries for Copy-On-Write tables and
'Read Optimized' queries
-for Merge-On-Read tables. This is through the initial input format based
integration in PrestoDB (pre forking). This approach has
-known performance limitations with very large tables, which has been since
fixed on PrestoDB.
-We recommend using the new Hudi connector in Trino (released since Trino
version 398).
-:::
+### Hive Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+| > = 406 | Requires the `hudi-trino-bundle` jar to be placed into
`<trino_install>/plugin/hive` | Snapshot querying on COW tables. Read optimized
querying on MOR tables. **Redirection to Hudi catalog also supported.** |
+| > = 411 | NA | Snapshot querying on COW tables. Read optimized
querying on MOR tables. Hudi tables can be **only** queried by [table
redirection](https://trino.io/docs/current/connector/hive.html#table-redirection).
|
-To query Hudi tables on Trino, please place the `hudi-trino-bundle` jar into
the Hive connector installation `<trino_install>/plugin/hive-hadoop2`.
+If you are using Trino version 411 or greater, and also using Hive connector
to query Hudi tables, please set the below config to support table redirection.
+```
+hive.hudi-catalog-name=hudi
+```
+It is recommended to use `hudi-trino-bundle` version 0.12.2 or later for
optimal query performance with Hive connector.
+
+### Hudi Connector
+
+| **Trino Version** | **Installation description** | **Query types supported**
|
+|-------------------|------------------------------|---------------------------|
+| < 398 | NA - can only use Hive connector to query Hudi tables |
Same as that of Hive connector version < 406. |
+| > = 398 | NA - no need to place bundle jars manually, as they are
compile-time dependency | Snapshot querying on COW tables. Read optimized
querying on MOR tables. |
+
+To learn more about the usage of Hudi connector, please check out
+the [connector
documentation](https://trino.io/docs/current/connector/hudi.html). Both the
connectors are on par in
+terms of query support, i.e. 'Snapshot' queries for COW tables and 'Read
Optimized' queries for MOR
+tables. We have an active [PR](https://github.com/trinodb/trino/pull/16034)
under review to bring the performance of
+Hudi connector on par with Hive connector. Furthermore, we
+expect [MOR table snapshot query](https://github.com/trinodb/trino/pull/14786)
support will soon be added to the Hudi
+connector.
## Hive