This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/arrow-datafusion.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 34b686493b Publish built docs triggered by 
89ee9b0c9b27324a3662e5b50b56902eef7d7749
34b686493b is described below

commit 34b686493bef2207d04f0f767ffc078793bfee6b
Author: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
AuthorDate: Wed Feb 21 01:10:06 2024 +0000

    Publish built docs triggered by 89ee9b0c9b27324a3662e5b50b56902eef7d7749
---
 _sources/user-guide/sql/dml.md.txt           | 12 ++++++++++++
 _sources/user-guide/sql/write_options.md.txt |  8 +++++---
 searchindex.js                               |  2 +-
 user-guide/sql/dml.html                      | 10 ++++++++++
 user-guide/sql/write_options.html            |  5 +++++
 5 files changed, 33 insertions(+), 4 deletions(-)

diff --git a/_sources/user-guide/sql/dml.md.txt 
b/_sources/user-guide/sql/dml.md.txt
index 79b1d6625e..405e77a21b 100644
--- a/_sources/user-guide/sql/dml.md.txt
+++ b/_sources/user-guide/sql/dml.md.txt
@@ -57,6 +57,18 @@ files in the `dir_name` directory:
 +-------+
 ```
 
+Copy the contents of `source_table` to multiple directories
+of hive-style partitioned parquet files:
+
+```sql
+> COPY source_table TO 'dir_name' (FORMAT parquet, partition_by 'column1, 
column2');
++-------+
+| count |
++-------+
+| 2     |
++-------+
+```
+
 Run the query `SELECT * from source ORDER BY time` and write the
 results (maintaining the order) to a parquet file named
 `output.parquet` with a maximum parquet row group size of 10MB:
diff --git a/_sources/user-guide/sql/write_options.md.txt 
b/_sources/user-guide/sql/write_options.md.txt
index 09d51903f4..ac0a41a97f 100644
--- a/_sources/user-guide/sql/write_options.md.txt
+++ b/_sources/user-guide/sql/write_options.md.txt
@@ -56,6 +56,7 @@ TO 'test/table_with_options'
 (format parquet,
 compression snappy,
 'compression::col1' 'zstd(5)',
+partition_by 'column3, column4'
 )
 ```
 
@@ -67,9 +68,10 @@ In this example, we write the entirety of `source_table` out 
to a folder of parq
 
 The following special options are specific to the `COPY` command.
 
-| Option | Description                                                         
                                                                                
                                | Default Value |
-| ------ | 
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 | ------------- |
-| FORMAT | Specifies the file format COPY query will write out. If there're 
more than one output file or the format cannot be inferred from the file 
extension, then FORMAT must be specified. | N/A           |
+| Option       | Description                                                   
                                                                                
                                      | Default Value |
+| ------------ | 
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 | ------------- |
+| FORMAT       | Specifies the file format COPY query will write out. If 
there're more than one output file or the format cannot be inferred from the 
file extension, then FORMAT must be specified. | N/A           |
+| PARTITION_BY | Specifies the columns that the output files should be 
partitioned by into separate hive-style directories. Value should be a comma 
separated string literal, e.g. 'col1,col2'       | N/A           |
 
 ### JSON Format Specific Options
 
diff --git a/searchindex.js b/searchindex.js
index 4658c9502f..85a92acd1a 100644
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"docnames": ["contributor-guide/architecture", 
"contributor-guide/communication", "contributor-guide/index", 
"contributor-guide/quarterly_roadmap", "contributor-guide/roadmap", 
"contributor-guide/specification/index", 
"contributor-guide/specification/invariants", 
"contributor-guide/specification/output-field-name-semantic", "index", 
"library-user-guide/adding-udfs", "library-user-guide/building-logical-plans", 
"library-user-guide/catalogs", "library-user-guide/custom-tab [...]
\ No newline at end of file
+Search.setIndex({"docnames": ["contributor-guide/architecture", 
"contributor-guide/communication", "contributor-guide/index", 
"contributor-guide/quarterly_roadmap", "contributor-guide/roadmap", 
"contributor-guide/specification/index", 
"contributor-guide/specification/invariants", 
"contributor-guide/specification/output-field-name-semantic", "index", 
"library-user-guide/adding-udfs", "library-user-guide/building-logical-plans", 
"library-user-guide/catalogs", "library-user-guide/custom-tab [...]
\ No newline at end of file
diff --git a/user-guide/sql/dml.html b/user-guide/sql/dml.html
index fe6c0cb8be..e031144695 100644
--- a/user-guide/sql/dml.html
+++ b/user-guide/sql/dml.html
@@ -440,6 +440,16 @@ files in the <code class="docutils literal 
notranslate"><span class="pre">dir_na
 <span class="o">+</span><span class="c1">-------+</span>
 </pre></div>
 </div>
+<p>Copy the contents of <code class="docutils literal notranslate"><span 
class="pre">source_table</span></code> to multiple directories
+of hive-style partitioned parquet files:</p>
+<div class="highlight-sql notranslate"><div 
class="highlight"><pre><span></span><span class="o">&gt;</span><span class="w"> 
</span><span class="k">COPY</span><span class="w"> </span><span 
class="n">source_table</span><span class="w"> </span><span 
class="k">TO</span><span class="w"> </span><span 
class="s1">&#39;dir_name&#39;</span><span class="w"> </span><span 
class="p">(</span><span class="n">FORMAT</span><span class="w"> </span><span 
class="n">parquet</span><span class="p">,</span><span [...]
+<span class="o">+</span><span class="c1">-------+</span>
+<span class="o">|</span><span class="w"> </span><span 
class="k">count</span><span class="w"> </span><span class="o">|</span>
+<span class="o">+</span><span class="c1">-------+</span>
+<span class="o">|</span><span class="w"> </span><span class="mi">2</span><span 
class="w">     </span><span class="o">|</span>
+<span class="o">+</span><span class="c1">-------+</span>
+</pre></div>
+</div>
 <p>Run the query <code class="docutils literal notranslate"><span 
class="pre">SELECT</span> <span class="pre">*</span> <span 
class="pre">from</span> <span class="pre">source</span> <span 
class="pre">ORDER</span> <span class="pre">BY</span> <span 
class="pre">time</span></code> and write the
 results (maintaining the order) to a parquet file named
 <code class="docutils literal notranslate"><span 
class="pre">output.parquet</span></code> with a maximum parquet row group size 
of 10MB:</p>
diff --git a/user-guide/sql/write_options.html 
b/user-guide/sql/write_options.html
index e99ac04599..31ba9d451b 100644
--- a/user-guide/sql/write_options.html
+++ b/user-guide/sql/write_options.html
@@ -462,6 +462,7 @@
 <span class="p">(</span><span class="n">format</span><span class="w"> 
</span><span class="n">parquet</span><span class="p">,</span>
 <span class="n">compression</span><span class="w"> </span><span 
class="n">snappy</span><span class="p">,</span>
 <span class="s1">&#39;compression::col1&#39;</span><span class="w"> 
</span><span class="s1">&#39;zstd(5)&#39;</span><span class="p">,</span>
+<span class="n">partition_by</span><span class="w"> </span><span 
class="s1">&#39;column3, column4&#39;</span>
 <span class="p">)</span>
 </pre></div>
 </div>
@@ -484,6 +485,10 @@
 <td><p>Specifies the file format COPY query will write out. If there’re more 
than one output file or the format cannot be inferred from the file extension, 
then FORMAT must be specified.</p></td>
 <td><p>N/A</p></td>
 </tr>
+<tr class="row-odd"><td><p>PARTITION_BY</p></td>
+<td><p>Specifies the columns that the output files should be partitioned by 
into separate hive-style directories. Value should be a comma separated string 
literal, e.g. ‘col1,col2’</p></td>
+<td><p>N/A</p></td>
+</tr>
 </tbody>
 </table>
 </section>

Reply via email to