This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/arrow-datafusion.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 34b686493b Publish built docs triggered by
89ee9b0c9b27324a3662e5b50b56902eef7d7749
34b686493b is described below
commit 34b686493bef2207d04f0f767ffc078793bfee6b
Author: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
AuthorDate: Wed Feb 21 01:10:06 2024 +0000
Publish built docs triggered by 89ee9b0c9b27324a3662e5b50b56902eef7d7749
---
_sources/user-guide/sql/dml.md.txt | 12 ++++++++++++
_sources/user-guide/sql/write_options.md.txt | 8 +++++---
searchindex.js | 2 +-
user-guide/sql/dml.html | 10 ++++++++++
user-guide/sql/write_options.html | 5 +++++
5 files changed, 33 insertions(+), 4 deletions(-)
diff --git a/_sources/user-guide/sql/dml.md.txt
b/_sources/user-guide/sql/dml.md.txt
index 79b1d6625e..405e77a21b 100644
--- a/_sources/user-guide/sql/dml.md.txt
+++ b/_sources/user-guide/sql/dml.md.txt
@@ -57,6 +57,18 @@ files in the `dir_name` directory:
+-------+
```
+Copy the contents of `source_table` to multiple directories
+of hive-style partitioned parquet files:
+
+```sql
+> COPY source_table TO 'dir_name' (FORMAT parquet, partition_by 'column1,
column2');
++-------+
+| count |
++-------+
+| 2 |
++-------+
+```
+
Run the query `SELECT * from source ORDER BY time` and write the
results (maintaining the order) to a parquet file named
`output.parquet` with a maximum parquet row group size of 10MB:
diff --git a/_sources/user-guide/sql/write_options.md.txt
b/_sources/user-guide/sql/write_options.md.txt
index 09d51903f4..ac0a41a97f 100644
--- a/_sources/user-guide/sql/write_options.md.txt
+++ b/_sources/user-guide/sql/write_options.md.txt
@@ -56,6 +56,7 @@ TO 'test/table_with_options'
(format parquet,
compression snappy,
'compression::col1' 'zstd(5)',
+partition_by 'column3, column4'
)
```
@@ -67,9 +68,10 @@ In this example, we write the entirety of `source_table` out
to a folder of parq
The following special options are specific to the `COPY` command.
-| Option | Description
| Default Value |
-| ------ |
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| ------------- |
-| FORMAT | Specifies the file format COPY query will write out. If there're
more than one output file or the format cannot be inferred from the file
extension, then FORMAT must be specified. | N/A |
+| Option | Description
| Default Value |
+| ------------ |
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| ------------- |
+| FORMAT | Specifies the file format COPY query will write out. If
there're more than one output file or the format cannot be inferred from the
file extension, then FORMAT must be specified. | N/A |
+| PARTITION_BY | Specifies the columns that the output files should be
partitioned by into separate hive-style directories. Value should be a comma
separated string literal, e.g. 'col1,col2' | N/A |
### JSON Format Specific Options
diff --git a/searchindex.js b/searchindex.js
index 4658c9502f..85a92acd1a 100644
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"docnames": ["contributor-guide/architecture",
"contributor-guide/communication", "contributor-guide/index",
"contributor-guide/quarterly_roadmap", "contributor-guide/roadmap",
"contributor-guide/specification/index",
"contributor-guide/specification/invariants",
"contributor-guide/specification/output-field-name-semantic", "index",
"library-user-guide/adding-udfs", "library-user-guide/building-logical-plans",
"library-user-guide/catalogs", "library-user-guide/custom-tab [...]
\ No newline at end of file
+Search.setIndex({"docnames": ["contributor-guide/architecture",
"contributor-guide/communication", "contributor-guide/index",
"contributor-guide/quarterly_roadmap", "contributor-guide/roadmap",
"contributor-guide/specification/index",
"contributor-guide/specification/invariants",
"contributor-guide/specification/output-field-name-semantic", "index",
"library-user-guide/adding-udfs", "library-user-guide/building-logical-plans",
"library-user-guide/catalogs", "library-user-guide/custom-tab [...]
\ No newline at end of file
diff --git a/user-guide/sql/dml.html b/user-guide/sql/dml.html
index fe6c0cb8be..e031144695 100644
--- a/user-guide/sql/dml.html
+++ b/user-guide/sql/dml.html
@@ -440,6 +440,16 @@ files in the <code class="docutils literal
notranslate"><span class="pre">dir_na
<span class="o">+</span><span class="c1">-------+</span>
</pre></div>
</div>
+<p>Copy the contents of <code class="docutils literal notranslate"><span
class="pre">source_table</span></code> to multiple directories
+of hive-style partitioned parquet files:</p>
+<div class="highlight-sql notranslate"><div
class="highlight"><pre><span></span><span class="o">></span><span class="w">
</span><span class="k">COPY</span><span class="w"> </span><span
class="n">source_table</span><span class="w"> </span><span
class="k">TO</span><span class="w"> </span><span
class="s1">'dir_name'</span><span class="w"> </span><span
class="p">(</span><span class="n">FORMAT</span><span class="w"> </span><span
class="n">parquet</span><span class="p">,</span><span [...]
+<span class="o">+</span><span class="c1">-------+</span>
+<span class="o">|</span><span class="w"> </span><span
class="k">count</span><span class="w"> </span><span class="o">|</span>
+<span class="o">+</span><span class="c1">-------+</span>
+<span class="o">|</span><span class="w"> </span><span class="mi">2</span><span
class="w"> </span><span class="o">|</span>
+<span class="o">+</span><span class="c1">-------+</span>
+</pre></div>
+</div>
<p>Run the query <code class="docutils literal notranslate"><span
class="pre">SELECT</span> <span class="pre">*</span> <span
class="pre">from</span> <span class="pre">source</span> <span
class="pre">ORDER</span> <span class="pre">BY</span> <span
class="pre">time</span></code> and write the
results (maintaining the order) to a parquet file named
<code class="docutils literal notranslate"><span
class="pre">output.parquet</span></code> with a maximum parquet row group size
of 10MB:</p>
diff --git a/user-guide/sql/write_options.html
b/user-guide/sql/write_options.html
index e99ac04599..31ba9d451b 100644
--- a/user-guide/sql/write_options.html
+++ b/user-guide/sql/write_options.html
@@ -462,6 +462,7 @@
<span class="p">(</span><span class="n">format</span><span class="w">
</span><span class="n">parquet</span><span class="p">,</span>
<span class="n">compression</span><span class="w"> </span><span
class="n">snappy</span><span class="p">,</span>
<span class="s1">'compression::col1'</span><span class="w">
</span><span class="s1">'zstd(5)'</span><span class="p">,</span>
+<span class="n">partition_by</span><span class="w"> </span><span
class="s1">'column3, column4'</span>
<span class="p">)</span>
</pre></div>
</div>
@@ -484,6 +485,10 @@
<td><p>Specifies the file format COPY query will write out. If there’re more
than one output file or the format cannot be inferred from the file extension,
then FORMAT must be specified.</p></td>
<td><p>N/A</p></td>
</tr>
+<tr class="row-odd"><td><p>PARTITION_BY</p></td>
+<td><p>Specifies the columns that the output files should be partitioned by
into separate hive-style directories. Value should be a comma separated string
literal, e.g. ‘col1,col2’</p></td>
+<td><p>N/A</p></td>
+</tr>
</tbody>
</table>
</section>