dengzhhu653 commented on code in PR #5629: URL: https://github.com/apache/hive/pull/5629#discussion_r1948430664
########## packaging/src/docker/README.md: ########## @@ -210,3 +210,61 @@ docker compose down select count(distinct a) from hive_example; select sum(b) from hive_example; ``` + +#### `sys` Schema and `information_schema` Schema + +`Hive Schema Tool` is located in the Docker Image at `/opt/hive/bin/schematool`. + +By default, system schemas such as `information_schema` for HiveServer2 are not created. +To create system schemas for a HiveServer2 instance, +users need to configure HiveServer2 to use a remote Hive Metastore Server and use a database other than embedded Derby for the Hive Metastore Server. + +Assuming `Maven` and `Docker CE` are installed, a possible use case is as follows. +Create a `compose.yaml` file in the current directory, + +```yaml +services: + some-postgres: + image: postgres:17.2-bookworm + environment: + POSTGRES_PASSWORD: "example" + metastore-standalone: + image: apache/hive:4.0.1 + depends_on: + - some-postgres + environment: + SERVICE_NAME: metastore + DB_DRIVER: postgres + SERVICE_OPTS: >- + -Djavax.jdo.option.ConnectionDriverName=org.postgresql.Driver + -Djavax.jdo.option.ConnectionURL=jdbc:postgresql://some-postgres:5432/postgres + -Djavax.jdo.option.ConnectionUserName=postgres + -Djavax.jdo.option.ConnectionPassword=example + volumes: + - ~/.m2/repository/org/postgresql/postgresql/42.7.5/postgresql-42.7.5.jar:/opt/hive/lib/postgres.jar + hiveserver2-standalone: + image: apache/hive:4.0.1 + depends_on: + - metastore-standalone + environment: + SERVICE_NAME: hiveserver2 + IS_RESUME: true + SERVICE_OPTS: >- + -Djavax.jdo.option.ConnectionDriverName=org.postgresql.Driver + -Djavax.jdo.option.ConnectionURL=jdbc:postgresql://some-postgres:5432/postgres + -Djavax.jdo.option.ConnectionUserName=postgres + -Djavax.jdo.option.ConnectionPassword=example + -Dhive.metastore.uris=thrift://metastore-standalone:9083 + volumes: + - ~/.m2/repository/org/postgresql/postgresql/42.7.5/postgresql-42.7.5.jar:/opt/hive/lib/postgres.jar +``` + +Then execute the shell command as follows to initialize the system schemas in HiveServer2. + +```shell +mvn dependency:get -Dartifact=org.postgresql:postgresql:42.7.5 +docker compose up -d +docker compose exec hiveserver2-standalone /bin/bash +/opt/hive/bin/schematool -initSchema -dbType hive -metaDbType postgres -url jdbc:hive2://localhost:10000/default +exit +``` Review Comment: No, the mvn is an example of mounting the external maven jdbc driver to the docker container, we can use any other ways for this purpose as long as the HMS or HS2 can access to this driver. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For additional commands, e-mail: gitbox-h...@hive.apache.org