adoroszlai commented on code in PR #7584: URL: https://github.com/apache/ozone/pull/7584#discussion_r1906465588
########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); Review Comment: ```suggestion CREATE TABLE t1 (x INT, s STRING); ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + Review Comment: This looks odd in the _Application Integrations_ page.  ```suggestion ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); +``` + +The data will be stored under the Hive Warehouse Directory path in Ozone. + +### Specifying a Custom Ozone Path + +You can create managed databases, tables, or partitions at a specific Ozone path using the `LOCATION` clause. Example: + +```sql + CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; +``` + +```sql + CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; +``` + +## External Tables Review Comment: ```suggestion ### External Tables ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; Review Comment: The `CREATE EXTERNAL TABLE` example is not indented, so I think the others can be unindented as well. ```suggestion CREATE DATABASE d1; ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types Review Comment: ```suggestion ## Supported Replication Types ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala Review Comment: ```suggestion ## Querying Ozone Data with Impala ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); +``` + +The data will be stored under the Hive Warehouse Directory path in Ozone. + +### Specifying a Custom Ozone Path + +You can create managed databases, tables, or partitions at a specific Ozone path using the `LOCATION` clause. Example: + +```sql + CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; Review Comment: ```suggestion CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols Review Comment: Impala is rendered as h1, so I think these sections should be h2. ```suggestion ## Supported Access Protocols ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); +``` + +The data will be stored under the Hive Warehouse Directory path in Ozone. + +### Specifying a Custom Ozone Path + +You can create managed databases, tables, or partitions at a specific Ozone path using the `LOCATION` clause. Example: + +```sql + CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; +``` + +```sql + CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; Review Comment: ```suggestion CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); +``` + +The data will be stored under the Hive Warehouse Directory path in Ozone. + +### Specifying a Custom Ozone Path + +You can create managed databases, tables, or partitions at a specific Ozone path using the `LOCATION` clause. Example: + +```sql + CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; +``` + +```sql + CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; +``` + +## External Tables + +You can create an external table in Impala to query Ozone data. For example: + +```sql +CREATE EXTERNAL TABLE external_table ( + id INT, + name STRING +) +LOCATION 'ofs://ozone1/vol1/bucket1/table1'; +``` + +* With external tables, the data is expected to be created and managed by another tool. +* Impala queries the data as-is. +* The metadata is stored under the external warehouse directory. +* Note: Dropping an external table in Impala does not delete the associated data. + + +# Using the S3A Protocol Review Comment: ```suggestion ## Using the S3A Protocol ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables Review Comment: ```suggestion ### Managed Tables ``` ########## hadoop-hdds/docs/content/integration/Impala.md: ########## @@ -0,0 +1,103 @@ +--- +title: Impala +weight: 4 +menu: + main: + parent: "Application Integrations" +--- +<!--- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + +# Overview + +Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. + +# Supported Access Protocols + +Impala supports the following protocols for accessing Ozone data: + +* ofs +* s3a + +Note: The o3fs protocol is **NOT** supported by Impala. + +# Supported Replication Types + +Impala is compatible with Ozone buckets configured with either: + +* RATIS (Replication) +* Erasure Coding + +# Querying Ozone Data with Impala + +Impala provides two approaches to interact with Ozone: + +* Managed Tables +* External Tables + +## Managed Tables + +If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: + +```sql + CREATE DATABASE d1; +``` + +```sql + CREATE TABLE t1 (x INT, s STRING); +``` + +The data will be stored under the Hive Warehouse Directory path in Ozone. + +### Specifying a Custom Ozone Path Review Comment: ```suggestion #### Specifying a Custom Ozone Path ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
