This is an automated email from the ASF dual-hosted git repository.
dongjoon pushed a commit to branch branch-1.8
in repository https://gitbox.apache.org/repos/asf/orc.git
The following commit(s) were added to refs/heads/branch-1.8 by this push:
new a35383ee0 ORC-1404: Bump `parquet` to 1.13.0
a35383ee0 is described below
commit a35383ee0f79adf3cbf10719e4c16a4357ce4267
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
AuthorDate: Mon Apr 10 09:55:32 2023 -0700
ORC-1404: Bump `parquet` to 1.13.0
Bumps `parquet.version` from 1.12.3 to 1.13.0.
Updates `parquet-hadoop` from 1.12.3 to 1.13.0
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/apache/parquet-mr/blob/master/CHANGES.md">parquet-hadoop's
changelog</a>.</em></p>
<blockquote>
<h3>Version 1.13.0</h3>
<p>Release Notes - Parquet - Version 1.13.0</p>
<h4>New Feature</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-1020">PARQUET-1020</a> -
Add support for Dynamic Messages in parquet-protobuf</li>
</ul>
<h4>Task</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2230">PARQUET-2230</a> -
Add a new rewrite command powered by ParquetRewriter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2228">PARQUET-2228</a> -
ParquetRewriter supports more than one input file</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2229">PARQUET-2229</a> -
ParquetRewriter supports masking and encrypting the same column</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2227">PARQUET-2227</a> -
Refactor different file rewriters to use single implementation</li>
</ul>
<h4>Improvement</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2258">PARQUET-2258</a> -
Storing toString fields in FilterPredicate instances can lead to memory
pressure</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2252">PARQUET-2252</a> -
Make some methods public to allow external projects to implement page
skipping</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2159">PARQUET-2159</a> -
Vectorized BytePacker decoder using Java VectorAPI</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2246">PARQUET-2246</a> -
Add short circuit logic to column index filter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2226">PARQUET-2226</a> -
Support merge Bloom Filters</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2224">PARQUET-2224</a> -
Publish SBOM artifacts</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2208">PARQUET-2208</a> -
Add details to nested column encryption config doc and exception text</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2195">PARQUET-2195</a> -
Add scan command to parquet-cli</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2196">PARQUET-2196</a> -
Support LZ4_RAW codec</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2176">PARQUET-2176</a> -
Column index/statistics truncation in ParquetWriter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2197">PARQUET-2197</a> -
Document uniform encryption</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2191">PARQUET-2191</a> -
Upgrade Scala to 2.12.17</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2169">PARQUET-2169</a> -
Upgrade Avro to version 1.11.1</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2155">PARQUET-2155</a> -
Upgrade protobuf version to 3.17.3</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2158">PARQUET-2158</a> -
Upgrade Hadoop dependency to version 3.2.0</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2138">PARQUET-2138</a> -
Add ShowBloomFilterCommand to parquet-cli</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2157">PARQUET-2157</a> -
Add BloomFilter fpp config</li>
</ul>
<h4>Bug</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2202">PARQUET-2202</a> -
Redundant String allocation on the hot path in
CapacityByteArrayOutputStream.setByte</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2164">PARQUET-2164</a> -
CapacityByteArrayOutputStream overflow while writing causes negative row group
sizes to be written</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2103">PARQUET-2103</a> -
Fix crypto exception in print toPrettyJSON</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2251">PARQUET-2251</a> -
Avoid generating Bloomfilter when all pages of a column are encoded by
dictionary</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2243">PARQUET-2243</a> -
Support zstd-jni in DirectCodecFactory</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2247">PARQUET-2247</a> -
Fail-fast if CapacityByteArrayOutputStream write overflow</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2241">PARQUET-2241</a> -
Fix ByteStreamSplitValuesReader with nulls</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2244">PARQUET-2244</a> -
Fix notIn for columns with null values</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2173">PARQUET-2173</a> -
Fix parquet build against hadoop 3.3.3+</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2219">PARQUET-2219</a> -
ParquetFileReader skips empty row group</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2198">PARQUET-2198</a> -
Updating jackson data bind version to fix CVEs</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2177">PARQUET-2177</a> -
Fix parquet-cli not to fail showing descriptions</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-1711">PARQUET-1711</a> -
Support recursive proto schemas by limiting recursion depth</li>
</ul>
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="https://github.com/apache/parquet-mr/commit/2e369ed173f66f057c296e63c1bc31d77f294f41"><code>2e369ed</code></a>
[maven-release-plugin] prepare release apache-parquet-1.13.0-rc0</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/a0955619dc32f9c5218793c008e87d1a1dc7407f"><code>a095561</code></a>
Update CHANGES.md for 1.13.0 release</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/8cbbaefd83eec126b6bb2388f038c7b6ca6cf435"><code>8cbbaef</code></a>
PARQUET-2262: Fix local build failure due to missing surefire.argLine (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1045">#1045</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/1235003e742e6a76bf6cb8f7ed33e942fa12d0d5"><code>1235003</code></a>
PARQUET-2258: Storing toString fields in FilterPredicate instances can lead
t...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/d38044f5395494e1543581a4b763f624305d3022"><code>d38044f</code></a>
PARQUET-2202: Review usage and implementation of Preconditions.checkargument
...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/72442b5c5cbf4bf20258dd22432a5c39081193fc"><code>72442b5</code></a>
PARQUET-2164: Check size of buffered data to prevent page data from
overflowi...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/cc145b3f4d82c7d4c67d45907163990c4bf084d6"><code>cc145b3</code></a>
PARQUET-2252: Make row range methods public (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1038">#1038</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/d730fa7466f9fa15e1e4a4254f22af9c6783acdc"><code>d730fa7</code></a>
PARQUET-2159: Vectorized BytePacker decoder using Java VectorAPI (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1011">#1011</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/43ce92862d0a1371f55a7ee663de5b08703d502a"><code>43ce928</code></a>
PARQUET-2230: [CLI] Deprecate commands replaced by rewrite</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/40bffcfddb929a8e8470a176e078c0782808534c"><code>40bffcf</code></a>
Add Gang Wu as committer (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1037">#1037</a>)</li>
<li>Additional commits viewable in <a
href="https://github.com/apache/parquet-mr/compare/apache-parquet-1.12.3...apache-parquet-1.13.0">compare
view</a></li>
</ul>
</details>
<br />
Updates `parquet-avro` from 1.12.3 to 1.13.0
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/apache/parquet-mr/blob/master/CHANGES.md">parquet-avro's
changelog</a>.</em></p>
<blockquote>
<h3>Version 1.13.0</h3>
<p>Release Notes - Parquet - Version 1.13.0</p>
<h4>New Feature</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-1020">PARQUET-1020</a> -
Add support for Dynamic Messages in parquet-protobuf</li>
</ul>
<h4>Task</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2230">PARQUET-2230</a> -
Add a new rewrite command powered by ParquetRewriter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2228">PARQUET-2228</a> -
ParquetRewriter supports more than one input file</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2229">PARQUET-2229</a> -
ParquetRewriter supports masking and encrypting the same column</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2227">PARQUET-2227</a> -
Refactor different file rewriters to use single implementation</li>
</ul>
<h4>Improvement</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2258">PARQUET-2258</a> -
Storing toString fields in FilterPredicate instances can lead to memory
pressure</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2252">PARQUET-2252</a> -
Make some methods public to allow external projects to implement page
skipping</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2159">PARQUET-2159</a> -
Vectorized BytePacker decoder using Java VectorAPI</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2246">PARQUET-2246</a> -
Add short circuit logic to column index filter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2226">PARQUET-2226</a> -
Support merge Bloom Filters</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2224">PARQUET-2224</a> -
Publish SBOM artifacts</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2208">PARQUET-2208</a> -
Add details to nested column encryption config doc and exception text</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2195">PARQUET-2195</a> -
Add scan command to parquet-cli</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2196">PARQUET-2196</a> -
Support LZ4_RAW codec</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2176">PARQUET-2176</a> -
Column index/statistics truncation in ParquetWriter</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2197">PARQUET-2197</a> -
Document uniform encryption</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2191">PARQUET-2191</a> -
Upgrade Scala to 2.12.17</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2169">PARQUET-2169</a> -
Upgrade Avro to version 1.11.1</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2155">PARQUET-2155</a> -
Upgrade protobuf version to 3.17.3</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2158">PARQUET-2158</a> -
Upgrade Hadoop dependency to version 3.2.0</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2138">PARQUET-2138</a> -
Add ShowBloomFilterCommand to parquet-cli</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2157">PARQUET-2157</a> -
Add BloomFilter fpp config</li>
</ul>
<h4>Bug</h4>
<ul>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2202">PARQUET-2202</a> -
Redundant String allocation on the hot path in
CapacityByteArrayOutputStream.setByte</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2164">PARQUET-2164</a> -
CapacityByteArrayOutputStream overflow while writing causes negative row group
sizes to be written</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2103">PARQUET-2103</a> -
Fix crypto exception in print toPrettyJSON</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2251">PARQUET-2251</a> -
Avoid generating Bloomfilter when all pages of a column are encoded by
dictionary</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2243">PARQUET-2243</a> -
Support zstd-jni in DirectCodecFactory</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2247">PARQUET-2247</a> -
Fail-fast if CapacityByteArrayOutputStream write overflow</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2241">PARQUET-2241</a> -
Fix ByteStreamSplitValuesReader with nulls</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2244">PARQUET-2244</a> -
Fix notIn for columns with null values</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2173">PARQUET-2173</a> -
Fix parquet build against hadoop 3.3.3+</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2219">PARQUET-2219</a> -
ParquetFileReader skips empty row group</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2198">PARQUET-2198</a> -
Updating jackson data bind version to fix CVEs</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-2177">PARQUET-2177</a> -
Fix parquet-cli not to fail showing descriptions</li>
<li><a
href="https://issues.apache.org/jira/browse/PARQUET-1711">PARQUET-1711</a> -
Support recursive proto schemas by limiting recursion depth</li>
</ul>
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="https://github.com/apache/parquet-mr/commit/2e369ed173f66f057c296e63c1bc31d77f294f41"><code>2e369ed</code></a>
[maven-release-plugin] prepare release apache-parquet-1.13.0-rc0</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/a0955619dc32f9c5218793c008e87d1a1dc7407f"><code>a095561</code></a>
Update CHANGES.md for 1.13.0 release</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/8cbbaefd83eec126b6bb2388f038c7b6ca6cf435"><code>8cbbaef</code></a>
PARQUET-2262: Fix local build failure due to missing surefire.argLine (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1045">#1045</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/1235003e742e6a76bf6cb8f7ed33e942fa12d0d5"><code>1235003</code></a>
PARQUET-2258: Storing toString fields in FilterPredicate instances can lead
t...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/d38044f5395494e1543581a4b763f624305d3022"><code>d38044f</code></a>
PARQUET-2202: Review usage and implementation of Preconditions.checkargument
...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/72442b5c5cbf4bf20258dd22432a5c39081193fc"><code>72442b5</code></a>
PARQUET-2164: Check size of buffered data to prevent page data from
overflowi...</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/cc145b3f4d82c7d4c67d45907163990c4bf084d6"><code>cc145b3</code></a>
PARQUET-2252: Make row range methods public (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1038">#1038</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/d730fa7466f9fa15e1e4a4254f22af9c6783acdc"><code>d730fa7</code></a>
PARQUET-2159: Vectorized BytePacker decoder using Java VectorAPI (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1011">#1011</a>)</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/43ce92862d0a1371f55a7ee663de5b08703d502a"><code>43ce928</code></a>
PARQUET-2230: [CLI] Deprecate commands replaced by rewrite</li>
<li><a
href="https://github.com/apache/parquet-mr/commit/40bffcfddb929a8e8470a176e078c0782808534c"><code>40bffcf</code></a>
Add Gang Wu as committer (<a
href="https://redirect.github.com/apache/parquet-mr/issues/1037">#1037</a>)</li>
<li>Additional commits viewable in <a
href="https://github.com/apache/parquet-mr/compare/apache-parquet-1.12.3...apache-parquet-1.13.0">compare
view</a></li>
</ul>
</details>
<br />
Dependabot will resolve any conflicts with this PR as long as you don't
alter it yourself. You can also trigger a rebase manually by commenting
`dependabot rebase`.
[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)
---
<details>
<summary>Dependabot commands and options</summary>
<br />
You can trigger Dependabot actions by commenting on this PR:
- `dependabot rebase` will rebase this PR
- `dependabot recreate` will recreate this PR, overwriting any edits that
have been made to it
- `dependabot merge` will merge this PR after your CI passes on it
- `dependabot squash and merge` will squash and merge this PR after your CI
passes on it
- `dependabot cancel merge` will cancel a previously requested merge and
block automerging
- `dependabot reopen` will reopen this PR if it is closed
- `dependabot close` will close this PR and stop Dependabot recreating it.
You can achieve the same result by closing it manually
- `dependabot ignore this major version` will close this PR and stop
Dependabot creating any more for this major version (unless you reopen the PR
or upgrade to it yourself)
- `dependabot ignore this minor version` will close this PR and stop
Dependabot creating any more for this minor version (unless you reopen the PR
or upgrade to it yourself)
- `dependabot ignore this dependency` will close this PR and stop
Dependabot creating any more for this dependency (unless you reopen the PR or
upgrade to it yourself)
</details>
Closes #1462 from
dependabot[bot]/dependabot/maven/java/parquet.version-1.13.0.
Authored-by: dependabot[bot]
<49699333+dependabot[bot]@users.noreply.github.com>
Signed-off-by: Dongjoon Hyun <[email protected]>
(cherry picked from commit 8e113db5b6720a810dfbb8ed8db251254d7e0990)
Signed-off-by: Dongjoon Hyun <[email protected]>
---
java/bench/pom.xml | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/java/bench/pom.xml b/java/bench/pom.xml
index 5a3c74fbc..f825c7fee 100644
--- a/java/bench/pom.xml
+++ b/java/bench/pom.xml
@@ -39,7 +39,7 @@
<jmh.version>1.20</jmh.version>
<junit.version>5.8.2</junit.version>
<orc.version>${project.version}</orc.version>
- <parquet.version>1.12.3</parquet.version>
+ <parquet.version>1.13.0</parquet.version>
<spark.version>3.3.2</spark.version>
</properties>