This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/parquet-site.git
The following commit(s) were added to refs/heads/asf-site by this push:
new f699052 deploy: 4557062b324902b0543855cda7746621abe3fcd5
f699052 is described below
commit f6990524087cc278390482369ba3bfd2590230de
Author: wgtmac <[email protected]>
AuthorDate: Wed Feb 5 10:02:20 2025 +0000
deploy: 4557062b324902b0543855cda7746621abe3fcd5
---
output/docs/_print/index.html | 2 +-
output/docs/file-format/_print/index.html | 2 +-
.../file-format/implementationstatus/index.html | 18 +++---
output/docs/file-format/index.xml | 73 ++++++++++++++++++++--
output/docs/index.xml | 73 ++++++++++++++++++++--
output/sitemap.xml | 2 +-
6 files changed, 150 insertions(+), 20 deletions(-)
diff --git a/output/docs/_print/index.html b/output/docs/_print/index.html
index 12a68eb..360960a 100644
--- a/output/docs/_print/index.html
+++ b/output/docs/_print/index.html
@@ -814,7 +814,7 @@ indices, and page offsets to scan in each column. The
reader can then
initialize a scanner for each column and fast forward them to the start row of
the scan.</p><p>The <code>min_values</code> and <code>max_values</code> are
calculated based on the <code>column_orders</code>
field in the <code>FileMetaData</code> struct of the footer.</p></div><div
class=td-content style=page-break-before:always><h1
id=pg-e0ad5830788d45de8b55e0c2b119349a>3.10 - Implementation status</h1><p>This
page summarizes the features supported by different Parquet
-implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not
supported</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a
href=https://github.com/apache/parquet-java>parquet-java</a></li><li><code>Go
[...]
+implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not supported</li><li>(R/W):
partial reader/writer only support</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a href=https://github.com/apa [...]
Java resources can be build using <code>mvn package</code>. The current stable
version should always be available from Maven Central.</p><p>C++ thrift
resources can be generated via make.</p><p>Thrift can be also code-genned into
any other thrift-supported language.</p></div><div class=td-content
style=page-break-before:always><h1 id=pg-47cac26307c77b16f1b9e75c1e46efec>4.3 -
Contributing to Parquet-Java</h1><div class=lead>How to contribute to
Parquet-Java</div><h2 id=pull-requests>Pull [...]
</span></span></span><span style=display:flex><span><span
style=color:#8f5902;font-style:italic> * @param c the current class
</span></span></span><span style=display:flex><span><span
style=color:#8f5902;font-style:italic> * @return the corresponding logger
diff --git a/output/docs/file-format/_print/index.html
b/output/docs/file-format/_print/index.html
index 60c08f9..4751301 100644
--- a/output/docs/file-format/_print/index.html
+++ b/output/docs/file-format/_print/index.html
@@ -807,6 +807,6 @@ indices, and page offsets to scan in each column. The
reader can then
initialize a scanner for each column and fast forward them to the start row of
the scan.</p><p>The <code>min_values</code> and <code>max_values</code> are
calculated based on the <code>column_orders</code>
field in the <code>FileMetaData</code> struct of the footer.</p></div><div
class=td-content style=page-break-before:always><h1
id=pg-e0ad5830788d45de8b55e0c2b119349a>10 - Implementation status</h1><p>This
page summarizes the features supported by different Parquet
-implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not
supported</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a
href=https://github.com/apache/parquet-java>parquet-java</a></li><li><code>Go
[...]
+implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not supported</li><li>(R/W):
partial reader/writer only support</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a href=https://github.com/apa [...]
2025
<span class=td-footer__authors>Apache Parquet</span></span><span
class=td-footer__all_rights_reserved>All Rights Reserved</span><span
class=ms-2><a href=https://policies.google.com/privacy target=_blank
rel=noopener>Privacy Policy</a></span></div></div></div></footer></div><script
src=/js/main.min.26b35480299b932e285af8358c943de97509b95a0086d091584e7cb9b00c5c7b.js
integrity="sha256-JrNUgCmbky4oWvg1jJQ96XUJuVoAhtCRWE58ubAMXHs="
crossorigin=anonymous></script><script defer src=/js/click-to [...]
\ No newline at end of file
diff --git a/output/docs/file-format/implementationstatus/index.html
b/output/docs/file-format/implementationstatus/index.html
index 9e358a5..4d4a4b9 100644
--- a/output/docs/file-format/implementationstatus/index.html
+++ b/output/docs/file-format/implementationstatus/index.html
@@ -2,24 +2,24 @@
<meta name=description content="This page summarizes the features supported by
different Parquet implementations.
Note: This is a work in progress and we would welcome help expanding its scope.
Legend The value in each box means:
-✅: supported ❌: not supported (blank) no data Implementations:
-C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs Physical
types Data type C++ Java Go Rust BOOLEAN INT32 INT64 INT96 (1) FLOAT DOUBLE
BYTE_ARRAY FIXED_LEN_BYTE_ARRAY (1) This type is deprecated, but as of 2024
it&rsquo;s common in currently produced parquet files Logical types Data
type C++ Java Go Rust STRING ENUM UUID 8, 16, 32, 64 bit signed and unsigned
INT DECIMAL (INT32) DECIMAL (INT64) DECIMAL (BYTE_ARRAY) DECIMAL
(FIXED_LEN_BYTE_ARRAY) DATE TIME (INT32) TIM [...]
+✅: supported ❌: not supported (R/W): partial reader/writer only support
(blank) no data Implementations:
+C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs cuDF: cudf
Physical types Data type C++ Java Go Rust cuDF BOOLEAN ✅ INT32 ✅ INT64 ✅ INT96
(1) ✅ FLOAT ✅ DOUBLE ✅ BYTE_ARRAY ✅ FIXED_LEN_BYTE_ARRAY ✅ (1) This type is
deprecated, but as of 2024 it&rsquo;s common in currently produced parquet
files Logical types Data type C++ Java Go Rust cuDF STRING ✅ ENUM ❌ UUID ❌ 8,
16, 32, 64 bit signed and unsigned INT ✅ DECIMAL (INT32) ✅ DECIMAL (INT64) ✅
DECIMAL (BYTE_ARRAY) ✅ D [...]
Note: This is a work in progress and we would welcome help expanding its scope.
Legend The value in each box means:
-✅: supported ❌: not supported (blank) no data Implementations:
-C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs Physical
types Data type C++ Java Go Rust BOOLEAN INT32 INT64 INT96 (1) FLOAT DOUBLE
BYTE_ARRAY FIXED_LEN_BYTE_ARRAY (1) This type is deprecated, but as of 2024
it’s common in currently produced parquet files Logical types Data type
C++ Java Go Rust STRING ENUM UUID 8, 16, 32, 64 bit signed and unsigned INT
DECIMAL (INT32) DECIMAL (INT64) DECIMAL (BYTE_ARRAY) DECIMAL
(FIXED_LEN_BYTE_ARRAY) DATE TIME (INT32) TIME (I [...]
+✅: supported ❌: not supported (R/W): partial reader/writer only support
(blank) no data Implementations:
+C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs cuDF: cudf
Physical types Data type C++ Java Go Rust cuDF BOOLEAN ✅ INT32 ✅ INT64 ✅ INT96
(1) ✅ FLOAT ✅ DOUBLE ✅ BYTE_ARRAY ✅ FIXED_LEN_BYTE_ARRAY ✅ (1) This type is
deprecated, but as of 2024 it’s common in currently produced parquet
files Logical types Data type C++ Java Go Rust cuDF STRING ✅ ENUM ❌ UUID ❌ 8,
16, 32, 64 bit signed and unsigned INT ✅ DECIMAL (INT32) ✅ DECIMAL (INT64) ✅
DECIMAL (BYTE_ARRAY) ✅ DECIM [...]
Note: This is a work in progress and we would welcome help expanding its scope.
Legend The value in each box means:
-✅: supported ❌: not supported (blank) no data Implementations:
-C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs Physical
types Data type C++ Java Go Rust BOOLEAN INT32 INT64 INT96 (1) FLOAT DOUBLE
BYTE_ARRAY FIXED_LEN_BYTE_ARRAY (1) This type is deprecated, but as of 2024
it’s common in currently produced parquet files Logical types Data type
C++ Java Go Rust STRING ENUM UUID 8, 16, 32, 64 bit signed and unsigned INT
DECIMAL (INT32) DECIMAL (INT64) DECIMAL (BYTE_ARRAY) DECIMAL
(FIXED_LEN_BYTE_ARRAY) DATE TIME (INT32) TIME (I [...]
+✅: supported ❌: not supported (R/W): partial reader/writer only support
(blank) no data Implementations:
+C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs cuDF: cudf
Physical types Data type C++ Java Go Rust cuDF BOOLEAN ✅ INT32 ✅ INT64 ✅ INT96
(1) ✅ FLOAT ✅ DOUBLE ✅ BYTE_ARRAY ✅ FIXED_LEN_BYTE_ARRAY ✅ (1) This type is
deprecated, but as of 2024 it’s common in currently produced parquet
files Logical types Data type C++ Java Go Rust cuDF STRING ✅ ENUM ❌ UUID ❌ 8,
16, 32, 64 bit signed and unsigned INT ✅ DECIMAL (INT32) ✅ DECIMAL (INT64) ✅
DECIMAL (BYTE_ARRAY) ✅ DECIM [...]
Note: This is a work in progress and we would welcome help expanding its scope.
Legend The value in each box means:
-✅: supported ❌: not supported (blank) no data Implementations:
-C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs Physical
types Data type C++ Java Go Rust BOOLEAN INT32 INT64 INT96 (1) FLOAT DOUBLE
BYTE_ARRAY FIXED_LEN_BYTE_ARRAY (1) This type is deprecated, but as of 2024
it’s common in currently produced parquet files Logical types Data type
C++ Java Go Rust STRING ENUM UUID 8, 16, 32, 64 bit signed and unsigned INT
DECIMAL (INT32) DECIMAL (INT64) DECIMAL (BYTE_ARRAY) DECIMAL
(FIXED_LEN_BYTE_ARRAY) DATE TIME (INT32) TIME (I [...]
+✅: supported ❌: not supported (R/W): partial reader/writer only support
(blank) no data Implementations:
+C++: parquet-cpp Java: parquet-java Go: parquet-go Rust: parquet-rs cuDF: cudf
Physical types Data type C++ Java Go Rust cuDF BOOLEAN ✅ INT32 ✅ INT64 ✅ INT96
(1) ✅ FLOAT ✅ DOUBLE ✅ BYTE_ARRAY ✅ FIXED_LEN_BYTE_ARRAY ✅ (1) This type is
deprecated, but as of 2024 it’s common in currently produced parquet
files Logical types Data type C++ Java Go Rust cuDF STRING ✅ ENUM ❌ UUID ❌ 8,
16, 32, 64 bit signed and unsigned INT ✅ DECIMAL (INT32) ✅ DECIMAL (INT64) ✅
DECIMAL (BYTE_ARRAY) ✅ DECIM [...]
<a
href=https://github.com/apache/parquet-site/edit/production/content/en/docs/File%20Format/implementationstatus.md
class="td-page-meta--edit td-page-meta__edit" target=_blank rel=noopener><i
class="fa-solid fa-pen-to-square fa-fw"></i> Edit this page</a>
<a
href="https://github.com/apache/parquet-site/new/production/content/en/docs/File%20Format?filename=change-me.md&value=---%0Atitle%3A+%22Long+Page+Title%22%0AlinkTitle%3A+%22Short+Nav+Title%22%0Aweight%3A+100%0Adescription%3A+%3E-%0A+++++Page+description+for+heading+and+indexes.%0A---%0A%0A%23%23+Heading%0A%0AEdit+this+template+to+create+your+new+page.%0A%0A%2A+Give+it+a+good+name%2C+ending+in+%60.md%60+-+e.g.+%60getting-started.md%60%0A%2A+Edit+the+%22front+matter%22+section+at+th
[...]
<a
href="https://github.com/apache/parquet-site/issues/new?title=Implementation%20status"
class="td-page-meta--issue td-page-meta__issue" target=_blank rel=noopener><i
class="fa-solid fa-list-check fa-fw"></i> Create documentation issue</a>
<a id=print href=/docs/file-format/_print/><i class="fa-solid fa-print
fa-fw"></i> Print entire section</a></div><div class=td-toc><nav
id=TableOfContents><ul><li><ul><li><a href=#legend>Legend</a></li><li><a
href=#physical-types>Physical types</a></li><li><a href=#logical-types>Logical
types</a></li><li><a href=#encodings>Encodings</a></li><li><a
href=#compressions>Compressions</a></li><li><a
href=#other-format-level-features>Other format level features</a></li><li><a
href=#high-level-d [...]
-implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not
supported</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a
href=https://github.com/apache/parquet-java>parquet-java</a></li><li><code>Go
[...]
+implementations.</p><p><em>Note</em>: This is a work in progress and we would
welcome help expanding its scope.</p><h3 id=legend>Legend</h3><p>The value in
each box means:</p><ul><li>✅: supported</li><li>❌: not supported</li><li>(R/W):
partial reader/writer only support</li><li>(blank) no
data</li></ul><p>Implementations:</p><ul><li><code>C++</code>: <a
href=https://github.com/apache/arrow/tree/main/cpp/src/parquet>parquet-cpp</a></li><li><code>Java</code>:
<a href=https://github.com/apa [...]
2025
<span class=td-footer__authors>Apache Parquet</span></span><span
class=td-footer__all_rights_reserved>All Rights Reserved</span><span
class=ms-2><a href=https://policies.google.com/privacy target=_blank
rel=noopener>Privacy Policy</a></span></div></div></div></footer></div><script
src=/js/main.min.26b35480299b932e285af8358c943de97509b95a0086d091584e7cb9b00c5c7b.js
integrity="sha256-JrNUgCmbky4oWvg1jJQ96XUJuVoAhtCRWE58ubAMXHs="
crossorigin=anonymous></script><script defer src=/js/click-to [...]
\ No newline at end of file
diff --git a/output/docs/file-format/index.xml
b/output/docs/file-format/index.xml
index 31adf44..93b6ebf 100644
--- a/output/docs/file-format/index.xml
+++ b/output/docs/file-format/index.xml
@@ -451,6 +451,7 @@ implementations.</p>
<ul>
<li>✅: supported</li>
<li>❌: not supported</li>
+<li>(R/W): partial reader/writer only support</li>
<li>(blank) no data</li>
</ul>
<p>Implementations:</p>
@@ -459,6 +460,7 @@ implementations.</p>
<li><code>Java</code>: <a
href="https://github.com/apache/parquet-java">parquet-java</a></li>
<li><code>Go</code>: <a
href="https://github.com/apache/arrow-go/tree/main/parquet">parquet-go</a></li>
<li><code>Rust</code>: <a
href="https://github.com/apache/arrow-rs/blob/main/parquet/README.md">parquet-rs</a></li>
+<li><code>cuDF</code>: <a
href="https://github.com/rapidsai/cudf">cudf</a></li>
</ul>
<h3 id="physical-types">Physical types</h3>
<table>
@@ -469,6 +471,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -478,6 +481,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT32</td>
@@ -485,6 +489,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT64</td>
@@ -492,6 +497,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT96 (1)</td>
@@ -499,6 +505,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FLOAT</td>
@@ -506,6 +513,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DOUBLE</td>
@@ -513,6 +521,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BYTE_ARRAY</td>
@@ -520,6 +529,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FIXED_LEN_BYTE_ARRAY</td>
@@ -527,6 +537,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -542,6 +553,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -551,6 +563,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>ENUM</td>
@@ -558,6 +571,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>UUID</td>
@@ -565,6 +579,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>8, 16, 32, 64 bit signed and unsigned INT</td>
@@ -572,6 +587,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (INT32)</td>
@@ -579,6 +595,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (INT64)</td>
@@ -586,6 +603,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (BYTE_ARRAY)</td>
@@ -593,6 +611,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (FIXED_LEN_BYTE_ARRAY)</td>
@@ -600,6 +619,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DATE</td>
@@ -607,6 +627,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIME (INT32)</td>
@@ -614,6 +635,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIME (INT64)</td>
@@ -621,6 +643,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIMESTAMP (INT64)</td>
@@ -628,6 +651,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INTERVAL</td>
@@ -635,6 +659,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>JSON</td>
@@ -642,6 +667,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>BSON</td>
@@ -649,6 +675,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>LIST</td>
@@ -656,6 +683,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>MAP</td>
@@ -663,6 +691,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>UNKNOWN (always null)</td>
@@ -670,6 +699,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FLOAT16</td>
@@ -677,6 +707,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -689,6 +720,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -698,6 +730,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>PLAIN_DICTIONARY</td>
@@ -705,6 +738,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>RLE_DICTIONARY</td>
@@ -712,6 +746,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>RLE</td>
@@ -719,6 +754,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BIT_PACKED (deprecated)</td>
@@ -726,6 +762,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>DELTA_BINARY_PACKED</td>
@@ -733,6 +770,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DELTA_LENGTH_BYTE_ARRAY</td>
@@ -740,6 +778,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DELTA_BYTE_ARRAY</td>
@@ -747,6 +786,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BYTE_STREAM_SPLIT</td>
@@ -754,6 +794,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -766,6 +807,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -775,6 +817,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BROTLI</td>
@@ -782,6 +825,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>GZIP</td>
@@ -789,6 +833,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>LZ4 (deprecated)</td>
@@ -796,6 +841,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>LZ4_RAW</td>
@@ -803,6 +849,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>LZO</td>
@@ -810,6 +857,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>SNAPPY</td>
@@ -817,6 +865,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>ZSTD</td>
@@ -824,6 +873,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -836,15 +886,17 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
<tr>
-<td>xxxHash-based bloom filters</td>
+<td>xxHash-based bloom filters</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>Bloom filter length (1)</td>
@@ -852,6 +904,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>Statistics min_value, max_value</td>
@@ -859,6 +912,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>Page index</td>
@@ -866,6 +920,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>Page CRC32 checksum</td>
@@ -873,6 +928,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>Modular encryption</td>
@@ -880,6 +936,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>Size statistics (2)</td>
@@ -887,6 +944,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -907,6 +965,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -916,6 +975,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(W)</td>
</tr>
<tr>
<td>Row group &ldquo;Sorting column&rdquo; metadata (2)</td>
@@ -923,6 +983,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(W)</td>
</tr>
<tr>
<td>Row group pruning using statistics</td>
@@ -930,27 +991,31 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Reading select columns only</td>
+<td>Row group pruning using bloom filter</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Page pruning using statistics</td>
+<td>Reading select columns only</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Page pruning using bloom filter</td>
+<td>Page pruning using statistics</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
</tbody>
</table>
diff --git a/output/docs/index.xml b/output/docs/index.xml
index 07136c9..2b9b45d 100644
--- a/output/docs/index.xml
+++ b/output/docs/index.xml
@@ -1497,6 +1497,7 @@ implementations.</p>
<ul>
<li>✅: supported</li>
<li>❌: not supported</li>
+<li>(R/W): partial reader/writer only support</li>
<li>(blank) no data</li>
</ul>
<p>Implementations:</p>
@@ -1505,6 +1506,7 @@ implementations.</p>
<li><code>Java</code>: <a
href="https://github.com/apache/parquet-java">parquet-java</a></li>
<li><code>Go</code>: <a
href="https://github.com/apache/arrow-go/tree/main/parquet">parquet-go</a></li>
<li><code>Rust</code>: <a
href="https://github.com/apache/arrow-rs/blob/main/parquet/README.md">parquet-rs</a></li>
+<li><code>cuDF</code>: <a
href="https://github.com/rapidsai/cudf">cudf</a></li>
</ul>
<h3 id="physical-types">Physical types</h3>
<table>
@@ -1515,6 +1517,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -1524,6 +1527,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT32</td>
@@ -1531,6 +1535,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT64</td>
@@ -1538,6 +1543,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INT96 (1)</td>
@@ -1545,6 +1551,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FLOAT</td>
@@ -1552,6 +1559,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DOUBLE</td>
@@ -1559,6 +1567,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BYTE_ARRAY</td>
@@ -1566,6 +1575,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FIXED_LEN_BYTE_ARRAY</td>
@@ -1573,6 +1583,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -1588,6 +1599,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -1597,6 +1609,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>ENUM</td>
@@ -1604,6 +1617,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>UUID</td>
@@ -1611,6 +1625,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>8, 16, 32, 64 bit signed and unsigned INT</td>
@@ -1618,6 +1633,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (INT32)</td>
@@ -1625,6 +1641,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (INT64)</td>
@@ -1632,6 +1649,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (BYTE_ARRAY)</td>
@@ -1639,6 +1657,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DECIMAL (FIXED_LEN_BYTE_ARRAY)</td>
@@ -1646,6 +1665,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DATE</td>
@@ -1653,6 +1673,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIME (INT32)</td>
@@ -1660,6 +1681,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIME (INT64)</td>
@@ -1667,6 +1689,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>TIMESTAMP (INT64)</td>
@@ -1674,6 +1697,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>INTERVAL</td>
@@ -1681,6 +1705,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>JSON</td>
@@ -1688,6 +1713,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>BSON</td>
@@ -1695,6 +1721,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>LIST</td>
@@ -1702,6 +1729,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>MAP</td>
@@ -1709,6 +1737,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>UNKNOWN (always null)</td>
@@ -1716,6 +1745,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>FLOAT16</td>
@@ -1723,6 +1753,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -1735,6 +1766,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -1744,6 +1776,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>PLAIN_DICTIONARY</td>
@@ -1751,6 +1784,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>RLE_DICTIONARY</td>
@@ -1758,6 +1792,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>RLE</td>
@@ -1765,6 +1800,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BIT_PACKED (deprecated)</td>
@@ -1772,6 +1808,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>DELTA_BINARY_PACKED</td>
@@ -1779,6 +1816,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DELTA_LENGTH_BYTE_ARRAY</td>
@@ -1786,6 +1824,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>DELTA_BYTE_ARRAY</td>
@@ -1793,6 +1832,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BYTE_STREAM_SPLIT</td>
@@ -1800,6 +1840,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -1812,6 +1853,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -1821,6 +1863,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>BROTLI</td>
@@ -1828,6 +1871,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>GZIP</td>
@@ -1835,6 +1879,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>LZ4 (deprecated)</td>
@@ -1842,6 +1887,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>LZ4_RAW</td>
@@ -1849,6 +1895,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>LZO</td>
@@ -1856,6 +1903,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>SNAPPY</td>
@@ -1863,6 +1911,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>ZSTD</td>
@@ -1870,6 +1919,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -1882,15 +1932,17 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
<tr>
-<td>xxxHash-based bloom filters</td>
+<td>xxHash-based bloom filters</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>Bloom filter length (1)</td>
@@ -1898,6 +1950,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(R)</td>
</tr>
<tr>
<td>Statistics min_value, max_value</td>
@@ -1905,6 +1958,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>Page index</td>
@@ -1912,6 +1966,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
<td>Page CRC32 checksum</td>
@@ -1919,6 +1974,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>Modular encryption</td>
@@ -1926,6 +1982,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
<tr>
<td>Size statistics (2)</td>
@@ -1933,6 +1990,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
</tbody>
</table>
@@ -1953,6 +2011,7 @@ implementations.</p>
<th>Java</th>
<th>Go</th>
<th>Rust</th>
+<th>cuDF</th>
</tr>
</thead>
<tbody>
@@ -1962,6 +2021,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(W)</td>
</tr>
<tr>
<td>Row group &ldquo;Sorting column&rdquo; metadata (2)</td>
@@ -1969,6 +2029,7 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>(W)</td>
</tr>
<tr>
<td>Row group pruning using statistics</td>
@@ -1976,27 +2037,31 @@ implementations.</p>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Reading select columns only</td>
+<td>Row group pruning using bloom filter</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Page pruning using statistics</td>
+<td>Reading select columns only</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>✅</td>
</tr>
<tr>
-<td>Page pruning using bloom filter</td>
+<td>Page pruning using statistics</td>
<td></td>
<td></td>
<td></td>
<td></td>
+<td>❌</td>
</tr>
</tbody>
</table>
diff --git a/output/sitemap.xml b/output/sitemap.xml
index f04f8de..6eb6b16 100644
--- a/output/sitemap.xml
+++ b/output/sitemap.xml
@@ -1 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/docs/file-format/data-pages/compression/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/file-format/data-pages/encodings/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/file-format/data-pages/encryption/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/
[...]
\ No newline at end of file
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/docs/file-format/data-pages/compression/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/file-format/data-pages/encodings/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/file-format/data-pages/encryption/</loc><lastmod>2024-03-11T22:11:10+01:00</lastmod></url><url><loc>/docs/
[...]
\ No newline at end of file