-
Notifications
You must be signed in to change notification settings - Fork 29.3k
[SPARK-37558][DOC] Improve spark sql cli document #34821
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
01848dc
7a9ecbf
a0218f8
1c59308
cea5be4
8eb04bf
36ce29f
21fb349
0c77217
e274700
6d6b0cf
a06b166
8d2fa9a
4a7ca98
45ec6a0
403d966
d9f150c
453f260
d4f6778
29f3b45
2b6f130
147d8b0
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,169 @@ | ||
| --- | ||
| layout: global | ||
| title: Spark SQL CLI | ||
| displayTitle: Spark SQL CLI | ||
| license: | | ||
| Licensed to the Apache Software Foundation (ASF) under one or more | ||
| contributor license agreements. See the NOTICE file distributed with | ||
| this work for additional information regarding copyright ownership. | ||
| The ASF licenses this file to You under the Apache License, Version 2.0 | ||
| (the "License"); you may not use this file except in compliance with | ||
| the License. You may obtain a copy of the License at | ||
|
|
||
| http://www.apache.org/licenses/LICENSE-2.0 | ||
|
|
||
| Unless required by applicable law or agreed to in writing, software | ||
| distributed under the License is distributed on an "AS IS" BASIS, | ||
| WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
| See the License for the specific language governing permissions and | ||
| limitations under the License. | ||
| --- | ||
|
|
||
| * Table of contents | ||
| {:toc} | ||
|
|
||
|
|
||
| The Spark SQL CLI is a convenient tool to run the Hive metastore service in local mode and execute | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
| queries input from the command line. Note that the Spark SQL CLI cannot talk to the Thrift JDBC server. | ||
|
|
||
| To start the Spark SQL CLI, run the following in the Spark directory: | ||
|
|
||
| ./bin/spark-sql | ||
|
|
||
| Configuration of Hive is done by placing your `hive-site.xml`, `core-site.xml` and `hdfs-site.xml` files in `conf/`. | ||
|
AngersZhuuuu marked this conversation as resolved.
|
||
|
|
||
| ## Spark SQL Command Line Options | ||
|
|
||
| You may run `./bin/spark-sql --help` for a complete list of all available options. | ||
|
|
||
| CLI options: | ||
| -d,--define <key=value> Variable substitution to apply to Hive | ||
| commands. e.g. -d A=B or --define A=B | ||
| --database <databasename> Specify the database to use | ||
| -e <quoted-query-string> SQL from command line | ||
| -f <filename> SQL from files | ||
| -H,--help Print help information | ||
| --hiveconf <property=value> Use value for given property | ||
| --hivevar <key=value> Variable substitution to apply to Hive | ||
|
cloud-fan marked this conversation as resolved.
|
||
| commands. e.g. --hivevar A=B | ||
| -i <filename> Initialization SQL file | ||
| -S,--silent Silent mode in interactive shell | ||
| -v,--verbose Verbose mode (echo executed SQL to the | ||
| console) | ||
|
|
||
| ## The hiverc File | ||
|
|
||
| The Spark SQL CLI when invoked without the `-i` option will attempt to load `$HIVE_HOME/bin/.hiverc` and `$HOME/.hiverc` as initialization files. | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
|
|
||
| ## Spark SQL CLI Interactive Shell Commands | ||
|
|
||
| When `$SPARK__HOME/bin/spark-sql` is run without either the `-e` or `-f` option, it enters interactive shell mode. | ||
| Use `;` (semicolon) to terminate commands, but user can escape `;` by `\\;`. Comments in scripts can be specified using the `--` prefix. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What if
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we should explain more about
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. How about current? |
||
|
|
||
| <table class="table"> | ||
| <tr><th>Command</th><th>Description</th></tr> | ||
| <tr> | ||
| <td><code>quit</code> <code>exit</code></td> | ||
| <td>Use <code>quit</code> or <code>exit</code> to leave the interactive shell.</td> | ||
| </tr> | ||
| <tr> | ||
| <td><code>!<command></code></td> | ||
| <td>Executes a shell command from the Spark SQL CLI shell.</td> | ||
| </tr> | ||
| <tr> | ||
| <td><code>dfs <dfs command></code></td> | ||
| <td>Executes a dfs command from the Hive shell.</td> | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
| </tr> | ||
| <tr> | ||
| <td><code><query string></code></td> | ||
| <td>Executes a Spark SQL query and prints results to standard output.</td> | ||
| </tr> | ||
| <tr> | ||
| <td><code>source <filepath></code></td> | ||
| <td>Executes a script file inside the CLI.</td> | ||
| </tr> | ||
| </table> | ||
|
|
||
| ## Supported comment type | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
|
|
||
| <table class="table"> | ||
| <tr><th>Comment</th><th>Example</th></tr> | ||
| <tr> | ||
| <td>simple comment</td> | ||
| <td> | ||
| <code> | ||
| -- This is a simple comment. | ||
| <br> | ||
| SELECT 1; | ||
| </code> | ||
| </td> | ||
| </tr> | ||
| <tr> | ||
| <td>bracketed comment</td> | ||
| <td> | ||
| <code> | ||
| /* This is a bracketed comment. */ | ||
| <br> | ||
| SELECT 1; | ||
| </code> | ||
| </td> | ||
| </tr> | ||
| <tr> | ||
| <td>nested bracketed comment</td> | ||
| <td> | ||
| <code> | ||
| /* This is a /* nested bracketed comment*/ .*/ | ||
| <br> | ||
| SELECT 1; | ||
| </code> | ||
| </td> | ||
| </tr> | ||
| </table> | ||
|
|
||
| ## Examples | ||
|
|
||
| See Variable Substitution for examples of using the hiveconf option. | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
|
|
||
|
|
||
| Example of running a query from the command line | ||
|
AngersZhuuuu marked this conversation as resolved.
Outdated
|
||
|
|
||
| ./bin/spark-sql -e 'SELECT COL FROM TBL' | ||
|
|
||
| Example of setting Hive configuration variables | ||
|
|
||
| ./bin/spark-sql -e 'SELECT COL FROM TBL' --hiveconf hive.exec.scratchdir=/home/my/hive_scratch --hiveconf mapred.reduce.tasks=32 | ||
|
|
||
| Example of dumping data out from a query into a file using silent mode | ||
|
|
||
| ./bin/spark-sql -S -e 'SELECT COL FROM TBL' > result.txt | ||
|
|
||
| Example of running a script non-interactively from local disk | ||
|
|
||
| ./bin/spark-sql -f /path/to/spark-sql-script.sql | ||
|
cloud-fan marked this conversation as resolved.
|
||
|
|
||
| Example of running a script non-interactively from a Hadoop supported filesystem | ||
|
|
||
| ./bin/spark-sql -f hdfs://<namenode>:<port>/spark-sql-script.sql | ||
| ./bin/spark-sql -f s3://mys3bucket/spark-sql-script.sql | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. does this apply to the init file as well?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Yes, they call same method
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. then shall we have a individual section about path interpretation?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Yea, necessary, it's not same as normal usage.
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. How about current. |
||
|
|
||
| Example of running an initialization script before entering interactive mode | ||
|
|
||
| ./bin/spark-sql -i /path/to/spark-sql-init.sql | ||
|
|
||
| Example of entering interactive mode | ||
|
|
||
| ./bin/spark-sql | ||
| spark-sql> SELECT 1; | ||
| 1 | ||
| spark-sql> -- This is a simple comment. | ||
| spark-sql> SELECT 1; | ||
| 1 | ||
|
|
||
| Example of entering interactive mode with escape `;` in comment | ||
|
|
||
| ./bin/spark-sql | ||
| spark-sql>/* This is a comment contains \\; | ||
| > It won't be terminaled by \\; */ | ||
| > SELECT 1; | ||
| 1 | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.