diqiu50 commented on code in PR #4935: URL: https://github.com/apache/gravitino/pull/4935#discussion_r1772516992
########## docs/trino-connector/trino-cascading-query.md: ########## @@ -0,0 +1,155 @@ +--- +title: "Apache Gravitino connector Trino cascading query" +slug: /trino-connector/trino-cascading-query +keyword: gravitino connector trino +license: "This software is licensed under the Apache License version 2." +--- + +# Background +With `Apache Gravitino Trino connector` and `Gravitino Trino cascading connector`, you can implement cascading queries in Trino. +These connectors allow you to treat other Trino clusters as data sources for the current Trino cluster, +enabling queries across catalogs in different Trino clusters. + +This mechanism prioritizes executing queries in the Trino cluster located in the same region as the data, +based on the data distribution in the catalogs. By doing so, it significantly reduces the amount of data +transferred over the network, addressing the performance issues commonly found in traditional federated query engines +where large volumes of data need to be transmitted across networks. + +# Deploying Trino + +## Deploying Trino + +To setup the Apache Gravitino Trino cascading query environment, you should first deploy tow Trino environment, +and then install the Apache Gravitino Trino connector plugin and Gravitino Trino cascading connector plugin into Trino. +Please refer to the [Deploying Trino documentation](installation.md) and do the following steps: + +1. [Download](https://github.com/apache/gravitino/releases) the Gravitino Trino connector tarball and unpack it. + The tarball contains a single top-level directory `gravitino-trino-connector-<version>`, and rename the directory to `gravitino`. +2. [Download](https://github.com/datastrato/trino-cascading-connector/releases) the Gravitino Trino cascading connector tarball and unpack it. + The tarball contains a single top-level directory `gravitino-trino-cascading-connector-<version>`, and rename the directory to `trino`. +3. Copy the two connector directories to the Trino's plugin directory. + Normally, the directory location is `Trino-server-<version>/plugin`, and the directory contains other catalogs used by Trino. + +Ensure that the `plugin` directory contains `gravitino` and +`trino` subdirectories. Two Trino clusters need to be deployed on machines with hostnames `c1-trino` and `c2-trino`. Review Comment: Yes -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
