[GitHub] spark pull request #22746: [SPARK-24499][SQL][DOC] Split the page of sql-pro...

kiszk Thu, 18 Oct 2018 02:53:53 -0700

Github user kiszk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22746#discussion_r226241683
  
    --- Diff: docs/sql-distributed-sql-engine.md ---
    @@ -0,0 +1,85 @@
    +---
    +layout: global
    +title: Distributed SQL Engine
    +displayTitle: Distributed SQL Engine
    +---
    +
    +* Table of contents
    +{:toc}
    +
    +Spark SQL can also act as a distributed query engine using its JDBC/ODBC 
or command-line interface.
    +In this mode, end-users or applications can interact with Spark SQL 
directly to run SQL queries,
    +without the need to write any code.
    +
    +## Running the Thrift JDBC/ODBC server
    +
    +The Thrift JDBC/ODBC server implemented here corresponds to the 
[`HiveServer2`](https://cwiki.apache.org/confluence/display/Hive/Setting+Up+HiveServer2)
    +in Hive 1.2.1. You can test the JDBC server with the beeline script that 
comes with either Spark or Hive 1.2.1.
    +
    +To start the JDBC/ODBC server, run the following in the Spark directory:
    +
    +    ./sbin/start-thriftserver.sh
    +
    +This script accepts all `bin/spark-submit` command line options, plus a 
`--hiveconf` option to
    +specify Hive properties. You may run `./sbin/start-thriftserver.sh --help` 
for a complete list of
    +all available options. By default, the server listens on localhost:10000. 
You may override this
    +behaviour via either environment variables, i.e.:
    +
    +{% highlight bash %}
    +export HIVE_SERVER2_THRIFT_PORT=<listening-port>
    +export HIVE_SERVER2_THRIFT_BIND_HOST=<listening-host>
    +./sbin/start-thriftserver.sh \
    +  --master <master-uri> \
    +  ...
    +{% endhighlight %}
    +
    +or system properties:
    +
    +{% highlight bash %}
    +./sbin/start-thriftserver.sh \
    +  --hiveconf hive.server2.thrift.port=<listening-port> \
    +  --hiveconf hive.server2.thrift.bind.host=<listening-host> \
    +  --master <master-uri>
    +  ...
    +{% endhighlight %}
    +
    +Now you can use beeline to test the Thrift JDBC/ODBC server:
    +
    +    ./bin/beeline
    +
    +Connect to the JDBC/ODBC server in beeline with:
    +
    +    beeline> !connect jdbc:hive2://localhost:10000
    +
    +Beeline will ask you for a username and password. In non-secure mode, 
simply enter the username on
    +your machine and a blank password. For secure mode, please follow the 
instructions given in the
    +[beeline 
documentation](https://cwiki.apache.org/confluence/display/Hive/HiveServer2+Clients).
    +
    +Configuration of Hive is done by placing your `hive-site.xml`, 
`core-site.xml` and `hdfs-site.xml` files in `conf/`.
    +
    +You may also use the beeline script that comes with Hive.
    +
    +Thrift JDBC server also supports sending thrift RPC messages over HTTP 
transport.
    +Use the following setting to enable HTTP mode as system property or in 
`hive-site.xml` file in `conf/`:
    +
    +    hive.server2.transport.mode - Set this to value: http
    +    hive.server2.thrift.http.port - HTTP port number to listen on; default 
is 10001
    +    hive.server2.http.endpoint - HTTP endpoint; default is cliservice
    +
    +To test, use beeline to connect to the JDBC/ODBC server in http mode with:
    +
    +    beeline> !connect 
jdbc:hive2://<host>:<port>/<database>?hive.server2.transport.mode=http;hive.server2.thrift.http.path=<http_endpoint>
    +
    +
    +## Running the Spark SQL CLI
    +
    +The Spark SQL CLI is a convenient tool to run the Hive metastore service 
in local mode and execute
    +queries input from the command line. Note that the Spark SQL CLI cannot 
talk to the Thrift JDBC server.
    +
    +To start the Spark SQL CLI, run the following in the Spark directory:
    +
    +    ./bin/spark-sql
    +
    +Configuration of Hive is done by placing your `hive-site.xml`, 
`core-site.xml` and `hdfs-site.xml` files in `conf/`.
    +You may run `./bin/spark-sql --help` for a complete list of all available
    +options.
    --- End diff --
    
    super nit: this line can be concatenated with the previous line.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #22746: [SPARK-24499][SQL][DOC] Split the page of sql-pro...

Reply via email to