dependabot[bot] opened a new pull request, #3750: URL: https://github.com/apache/arrow-adbc/pull/3750
Bumps the arrow-datafusion group with 6 updates in the /rust directory: | Package | From | To | | --- | --- | --- | | [arrow-array](https://github.com/apache/arrow-rs) | `56.2.0` | `57.0.0` | | [arrow-buffer](https://github.com/apache/arrow-rs) | `56.2.0` | `57.0.0` | | [arrow-schema](https://github.com/apache/arrow-rs) | `56.2.0` | `57.0.0` | | [arrow-select](https://github.com/apache/arrow-rs) | `56.2.0` | `57.0.0` | | [datafusion](https://github.com/apache/datafusion) | `50.3.0` | `51.0.0` | | [datafusion-substrait](https://github.com/apache/datafusion) | `50.3.0` | `51.0.0` | Updates `arrow-array` from 56.2.0 to 57.0.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/releases">arrow-array's releases</a>.</em></p> <blockquote> <h2>arrow 57.0.0</h2> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.0.0">57.0.0</a> (2025-10-19)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">Full Changelog</a></p> <p><strong>Breaking changes:</strong></p> <ul> <li>Use <code>Arc<FileEncryptionProperties></code> everywhere to be be consistent with <code>FileDecryptionProperties</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8626">#8626</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>feat: Improve DataType display for <code>RunEndEncoded</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8596">#8596</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Add <code>ArrowError::AvroError</code>, remaining types and roundtrip tests to <code>arrow-avro</code>, <a href="https://redirect.github.com/apache/arrow-rs/pull/8595">#8595</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/jecsand838">jecsand838</a>)</li> <li>[thrift-remodel] Refactor Thrift encryption and store encodings as bitmask <a href="https://redirect.github.com/apache/arrow-rs/pull/8587">#8587</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>feat: Enhance <code>Map</code> display formatting in DataType <a href="https://redirect.github.com/apache/arrow-rs/pull/8570">#8570</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>feat: Enhance DataType display formatting for <code>ListView</code> and <code>LargeListView</code> variants <a href="https://redirect.github.com/apache/arrow-rs/pull/8569">#8569</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use custom thrift parser for parquet metadata (phase 1 of Thrift remodel) <a href="https://redirect.github.com/apache/arrow-rs/pull/8530">#8530</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>refactor: improve display formatting for Union <a href="https://redirect.github.com/apache/arrow-rs/pull/8529">#8529</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use <code>Arc<FileDecryptionProperties></code> to reduce size of ParquetMetadata and avoid copying when <code>encryption</code> is enabled <a href="https://redirect.github.com/apache/arrow-rs/pull/8470">#8470</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>Fix for column name based projection mask creation <a href="https://redirect.github.com/apache/arrow-rs/pull/8447">#8447</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>Improve Display formatting of DataType::Timestamp <a href="https://redirect.github.com/apache/arrow-rs/pull/8425">#8425</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Use more compact Debug formatting of Field <a href="https://redirect.github.com/apache/arrow-rs/pull/8424">#8424</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Reuse zstd compression context when writing IPC <a href="https://redirect.github.com/apache/arrow-rs/pull/8405">#8405</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>] (<a href="https://github.com/albertlockett">albertlockett</a>)</li> <li>[Decimal] Add scale argument to validation functions to ensure accurate error logging <a href="https://redirect.github.com/apache/arrow-rs/pull/8396">#8396</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Quote <code>DataType::Struct</code> field names in <code>Display</code> formatting <a href="https://redirect.github.com/apache/arrow-rs/pull/8291">#8291</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Improve <code>Display</code> for <code>DataType</code> and <code>Field</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8290">#8290</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Bump pyo3 to 0.26.0 <a href="https://redirect.github.com/apache/arrow-rs/pull/8286">#8286</a> (<a href="https://github.com/mbrobbel">mbrobbel</a>)</li> </ul> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Added Avro support (new <code>arrow-avro</code> crate) <a href="https://redirect.github.com/apache/arrow-rs/issues/4886">#4886</a></li> <li>parquet-rewrite: supports compression level and write batch size <a href="https://redirect.github.com/apache/arrow-rs/issues/8639">#8639</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/blob/main/CHANGELOG.md">arrow-array's changelog</a>.</em></p> <blockquote> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.1.0">57.1.0</a> (2025-11-20)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/57.0.0...57.1.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Eliminate bound checks in filter kernels <a href="https://redirect.github.com/apache/arrow-rs/issues/8865">#8865</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Respect page index policy option for ParquetObjectReader when it's not skip <a href="https://redirect.github.com/apache/arrow-rs/issues/8856">#8856</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Speed up collect_bool and remove <code>unsafe</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8848">#8848</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Error reading parquet FileMetaData with empty lists encoded as element-type=0 <a href="https://redirect.github.com/apache/arrow-rs/issues/8826">#8826</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>ValueStatistics methods can't be used from generic context in external crate <a href="https://redirect.github.com/apache/arrow-rs/issues/8823">#8823</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Custom Pretty-Printing Implementation for Column when Formatting Record Batches <a href="https://redirect.github.com/apache/arrow-rs/issues/8821">#8821</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet-concat: supports bloom filter and page index <a href="https://redirect.github.com/apache/arrow-rs/issues/8804">#8804</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[Parquet] virtual row group number support <a href="https://redirect.github.com/apache/arrow-rs/issues/8800">#8800</a></li> <li>[Variant] Enforce shredded-type validation in <code>shred_variant</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8795">#8795</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Simplify decision logic to call <code>FilterBuilder::optimize</code> or not <a href="https://redirect.github.com/apache/arrow-rs/issues/8781">#8781</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add variant to arrow for DataType::{Binary, LargeBinary, BinaryView} <a href="https://redirect.github.com/apache/arrow-rs/issues/8767">#8767</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Provide algorithm that allows zipping arrays whose values are not prealigned <a href="https://redirect.github.com/apache/arrow-rs/issues/8752">#8752</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Parquet] ParquetMetadataReader decodes too much metadata under point-get scenerio <a href="https://redirect.github.com/apache/arrow-rs/issues/8751">#8751</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li><code>arrow-json</code> supports encoding binary arrays, but not decoding <a href="https://redirect.github.com/apache/arrow-rs/issues/8736">#8736</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Allow <code>FilterPredicate</code> instances to be reused for RecordBatches <a href="https://redirect.github.com/apache/arrow-rs/issues/8692">#8692</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ArrowJsonBatch::from_batch is incomplete <a href="https://redirect.github.com/apache/arrow-rs/issues/8684">#8684</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>parquet-layout: More info about layout including footer size, page index, bloom filter? <a href="https://redirect.github.com/apache/arrow-rs/issues/8682">#8682</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Rewrite <code>ParquetRecordBatchStream</code> (async API) in terms of the PushDecoder <a href="https://redirect.github.com/apache/arrow-rs/issues/8677">#8677</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[JSON] Add encoding for binary view <a href="https://redirect.github.com/apache/arrow-rs/issues/8674">#8674</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Refactor arrow-cast decimal casting to unify the rescale logic used in Parquet variant casts <a href="https://redirect.github.com/apache/arrow-rs/issues/8670">#8670</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Support Uuid/<code>FixedSizeBinary(16)</code> shredding <a href="https://redirect.github.com/apache/arrow-rs/issues/8665">#8665</a></li> <li>[Parquet]There should be an encoding counter to know how many encodings the repo supports in total <a href="https://redirect.github.com/apache/arrow-rs/issues/8662">#8662</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Improve <code>parse_data_type</code> for <code>List</code>, <code>ListView</code>, <code>LargeList</code>, <code>LargeListView</code>, <code>FixedSizeList</code>, <code>Union</code>, <code>Map</code>, <code>RunEndCoded</code>. <a href="https://redirect.github.com/apache/arrow-rs/issues/8648">#8648</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/arrow-rs/commit/5faa7248bd0d2efec9e072c3f58014691c4bddf3"><code>5faa724</code></a> Prepare for <code>57.0.0</code> release (<a href="https://redirect.github.com/apache/arrow-rs/issues/8644">#8644</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/0c33129b890027b6d421fdfe153e192196b65ad9"><code>0c33129</code></a> parquet-rewrite: add write_batch_size and compression_level config (<a href="https://redirect.github.com/apache/arrow-rs/issues/8642">#8642</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/2f96204ec4a38246bd14ed9b45252498326a5b27"><code>2f96204</code></a> Adding <code>try_append_value</code> implementation to <code>ByteViewBuilder</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8594">#8594</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f131b5469655c2a1afc3b23ce5e3f850d6a389cf"><code>f131b54</code></a> bench: create <code>zip</code> kernel benchmarks (<a href="https://redirect.github.com/apache/arrow-rs/issues/8654">#8654</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/d49f017fe1c6712ba32e2222c6f031278b588ca5"><code>d49f017</code></a> Introduce a ThriftProtocolError to avoid allocating and formattings strings f...</li> <li><a href="https://github.com/apache/arrow-rs/commit/a85d8b025610ad7eea2266cb4ac383f7fd0c767e"><code>a85d8b0</code></a> [Variant] impl <code>PartialEq</code> and <code>FromIterator\<Option<..>></code> for <code>VariantArray</code> ...</li> <li><a href="https://github.com/apache/arrow-rs/commit/2ca3d609cc99e0c153d1ab2e62af16a4b3f79a12"><code>2ca3d60</code></a> fix: incorrect assertion in <code>BitChunks::new</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8620">#8620</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/5a384f4c3ccd397dcb8763d89e958da3fa4c666c"><code>5a384f4</code></a> Undeprecate <code>ArrowWriter::into_serialized_writer</code> and add docs (<a href="https://redirect.github.com/apache/arrow-rs/issues/8621">#8621</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f3baa809ef1a21fcfa23268ec7c92896353d94f2"><code>f3baa80</code></a> [thrift-remodel] Add macro to reduce boilerplate necessary to implement Thrif...</li> <li><a href="https://github.com/apache/arrow-rs/commit/caeb4d2e88fb0fea57f1d30b139be1fd6195f174"><code>caeb4d2</code></a> feat: Improve DataType display for <code>RunEndEncoded</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8596">#8596</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">compare view</a></li> </ul> </details> <br /> Updates `arrow-buffer` from 56.2.0 to 57.0.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/releases">arrow-buffer's releases</a>.</em></p> <blockquote> <h2>arrow 57.0.0</h2> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.0.0">57.0.0</a> (2025-10-19)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">Full Changelog</a></p> <p><strong>Breaking changes:</strong></p> <ul> <li>Use <code>Arc<FileEncryptionProperties></code> everywhere to be be consistent with <code>FileDecryptionProperties</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8626">#8626</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>feat: Improve DataType display for <code>RunEndEncoded</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8596">#8596</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Add <code>ArrowError::AvroError</code>, remaining types and roundtrip tests to <code>arrow-avro</code>, <a href="https://redirect.github.com/apache/arrow-rs/pull/8595">#8595</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/jecsand838">jecsand838</a>)</li> <li>[thrift-remodel] Refactor Thrift encryption and store encodings as bitmask <a href="https://redirect.github.com/apache/arrow-rs/pull/8587">#8587</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>feat: Enhance <code>Map</code> display formatting in DataType <a href="https://redirect.github.com/apache/arrow-rs/pull/8570">#8570</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>feat: Enhance DataType display formatting for <code>ListView</code> and <code>LargeListView</code> variants <a href="https://redirect.github.com/apache/arrow-rs/pull/8569">#8569</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use custom thrift parser for parquet metadata (phase 1 of Thrift remodel) <a href="https://redirect.github.com/apache/arrow-rs/pull/8530">#8530</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>refactor: improve display formatting for Union <a href="https://redirect.github.com/apache/arrow-rs/pull/8529">#8529</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use <code>Arc<FileDecryptionProperties></code> to reduce size of ParquetMetadata and avoid copying when <code>encryption</code> is enabled <a href="https://redirect.github.com/apache/arrow-rs/pull/8470">#8470</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>Fix for column name based projection mask creation <a href="https://redirect.github.com/apache/arrow-rs/pull/8447">#8447</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>Improve Display formatting of DataType::Timestamp <a href="https://redirect.github.com/apache/arrow-rs/pull/8425">#8425</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Use more compact Debug formatting of Field <a href="https://redirect.github.com/apache/arrow-rs/pull/8424">#8424</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Reuse zstd compression context when writing IPC <a href="https://redirect.github.com/apache/arrow-rs/pull/8405">#8405</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>] (<a href="https://github.com/albertlockett">albertlockett</a>)</li> <li>[Decimal] Add scale argument to validation functions to ensure accurate error logging <a href="https://redirect.github.com/apache/arrow-rs/pull/8396">#8396</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Quote <code>DataType::Struct</code> field names in <code>Display</code> formatting <a href="https://redirect.github.com/apache/arrow-rs/pull/8291">#8291</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Improve <code>Display</code> for <code>DataType</code> and <code>Field</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8290">#8290</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Bump pyo3 to 0.26.0 <a href="https://redirect.github.com/apache/arrow-rs/pull/8286">#8286</a> (<a href="https://github.com/mbrobbel">mbrobbel</a>)</li> </ul> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Added Avro support (new <code>arrow-avro</code> crate) <a href="https://redirect.github.com/apache/arrow-rs/issues/4886">#4886</a></li> <li>parquet-rewrite: supports compression level and write batch size <a href="https://redirect.github.com/apache/arrow-rs/issues/8639">#8639</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/blob/main/CHANGELOG.md">arrow-buffer's changelog</a>.</em></p> <blockquote> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.1.0">57.1.0</a> (2025-11-20)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/57.0.0...57.1.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Eliminate bound checks in filter kernels <a href="https://redirect.github.com/apache/arrow-rs/issues/8865">#8865</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Respect page index policy option for ParquetObjectReader when it's not skip <a href="https://redirect.github.com/apache/arrow-rs/issues/8856">#8856</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Speed up collect_bool and remove <code>unsafe</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8848">#8848</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Error reading parquet FileMetaData with empty lists encoded as element-type=0 <a href="https://redirect.github.com/apache/arrow-rs/issues/8826">#8826</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>ValueStatistics methods can't be used from generic context in external crate <a href="https://redirect.github.com/apache/arrow-rs/issues/8823">#8823</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Custom Pretty-Printing Implementation for Column when Formatting Record Batches <a href="https://redirect.github.com/apache/arrow-rs/issues/8821">#8821</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet-concat: supports bloom filter and page index <a href="https://redirect.github.com/apache/arrow-rs/issues/8804">#8804</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[Parquet] virtual row group number support <a href="https://redirect.github.com/apache/arrow-rs/issues/8800">#8800</a></li> <li>[Variant] Enforce shredded-type validation in <code>shred_variant</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8795">#8795</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Simplify decision logic to call <code>FilterBuilder::optimize</code> or not <a href="https://redirect.github.com/apache/arrow-rs/issues/8781">#8781</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add variant to arrow for DataType::{Binary, LargeBinary, BinaryView} <a href="https://redirect.github.com/apache/arrow-rs/issues/8767">#8767</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Provide algorithm that allows zipping arrays whose values are not prealigned <a href="https://redirect.github.com/apache/arrow-rs/issues/8752">#8752</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Parquet] ParquetMetadataReader decodes too much metadata under point-get scenerio <a href="https://redirect.github.com/apache/arrow-rs/issues/8751">#8751</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li><code>arrow-json</code> supports encoding binary arrays, but not decoding <a href="https://redirect.github.com/apache/arrow-rs/issues/8736">#8736</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Allow <code>FilterPredicate</code> instances to be reused for RecordBatches <a href="https://redirect.github.com/apache/arrow-rs/issues/8692">#8692</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ArrowJsonBatch::from_batch is incomplete <a href="https://redirect.github.com/apache/arrow-rs/issues/8684">#8684</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>parquet-layout: More info about layout including footer size, page index, bloom filter? <a href="https://redirect.github.com/apache/arrow-rs/issues/8682">#8682</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Rewrite <code>ParquetRecordBatchStream</code> (async API) in terms of the PushDecoder <a href="https://redirect.github.com/apache/arrow-rs/issues/8677">#8677</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[JSON] Add encoding for binary view <a href="https://redirect.github.com/apache/arrow-rs/issues/8674">#8674</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Refactor arrow-cast decimal casting to unify the rescale logic used in Parquet variant casts <a href="https://redirect.github.com/apache/arrow-rs/issues/8670">#8670</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Support Uuid/<code>FixedSizeBinary(16)</code> shredding <a href="https://redirect.github.com/apache/arrow-rs/issues/8665">#8665</a></li> <li>[Parquet]There should be an encoding counter to know how many encodings the repo supports in total <a href="https://redirect.github.com/apache/arrow-rs/issues/8662">#8662</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Improve <code>parse_data_type</code> for <code>List</code>, <code>ListView</code>, <code>LargeList</code>, <code>LargeListView</code>, <code>FixedSizeList</code>, <code>Union</code>, <code>Map</code>, <code>RunEndCoded</code>. <a href="https://redirect.github.com/apache/arrow-rs/issues/8648">#8648</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/arrow-rs/commit/5faa7248bd0d2efec9e072c3f58014691c4bddf3"><code>5faa724</code></a> Prepare for <code>57.0.0</code> release (<a href="https://redirect.github.com/apache/arrow-rs/issues/8644">#8644</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/0c33129b890027b6d421fdfe153e192196b65ad9"><code>0c33129</code></a> parquet-rewrite: add write_batch_size and compression_level config (<a href="https://redirect.github.com/apache/arrow-rs/issues/8642">#8642</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/2f96204ec4a38246bd14ed9b45252498326a5b27"><code>2f96204</code></a> Adding <code>try_append_value</code> implementation to <code>ByteViewBuilder</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8594">#8594</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f131b5469655c2a1afc3b23ce5e3f850d6a389cf"><code>f131b54</code></a> bench: create <code>zip</code> kernel benchmarks (<a href="https://redirect.github.com/apache/arrow-rs/issues/8654">#8654</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/d49f017fe1c6712ba32e2222c6f031278b588ca5"><code>d49f017</code></a> Introduce a ThriftProtocolError to avoid allocating and formattings strings f...</li> <li><a href="https://github.com/apache/arrow-rs/commit/a85d8b025610ad7eea2266cb4ac383f7fd0c767e"><code>a85d8b0</code></a> [Variant] impl <code>PartialEq</code> and <code>FromIterator\<Option<..>></code> for <code>VariantArray</code> ...</li> <li><a href="https://github.com/apache/arrow-rs/commit/2ca3d609cc99e0c153d1ab2e62af16a4b3f79a12"><code>2ca3d60</code></a> fix: incorrect assertion in <code>BitChunks::new</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8620">#8620</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/5a384f4c3ccd397dcb8763d89e958da3fa4c666c"><code>5a384f4</code></a> Undeprecate <code>ArrowWriter::into_serialized_writer</code> and add docs (<a href="https://redirect.github.com/apache/arrow-rs/issues/8621">#8621</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f3baa809ef1a21fcfa23268ec7c92896353d94f2"><code>f3baa80</code></a> [thrift-remodel] Add macro to reduce boilerplate necessary to implement Thrif...</li> <li><a href="https://github.com/apache/arrow-rs/commit/caeb4d2e88fb0fea57f1d30b139be1fd6195f174"><code>caeb4d2</code></a> feat: Improve DataType display for <code>RunEndEncoded</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8596">#8596</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">compare view</a></li> </ul> </details> <br /> Updates `arrow-schema` from 56.2.0 to 57.0.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/releases">arrow-schema's releases</a>.</em></p> <blockquote> <h2>arrow 57.0.0</h2> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.0.0">57.0.0</a> (2025-10-19)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">Full Changelog</a></p> <p><strong>Breaking changes:</strong></p> <ul> <li>Use <code>Arc<FileEncryptionProperties></code> everywhere to be be consistent with <code>FileDecryptionProperties</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8626">#8626</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>feat: Improve DataType display for <code>RunEndEncoded</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8596">#8596</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Add <code>ArrowError::AvroError</code>, remaining types and roundtrip tests to <code>arrow-avro</code>, <a href="https://redirect.github.com/apache/arrow-rs/pull/8595">#8595</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/jecsand838">jecsand838</a>)</li> <li>[thrift-remodel] Refactor Thrift encryption and store encodings as bitmask <a href="https://redirect.github.com/apache/arrow-rs/pull/8587">#8587</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>feat: Enhance <code>Map</code> display formatting in DataType <a href="https://redirect.github.com/apache/arrow-rs/pull/8570">#8570</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>feat: Enhance DataType display formatting for <code>ListView</code> and <code>LargeListView</code> variants <a href="https://redirect.github.com/apache/arrow-rs/pull/8569">#8569</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use custom thrift parser for parquet metadata (phase 1 of Thrift remodel) <a href="https://redirect.github.com/apache/arrow-rs/pull/8530">#8530</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>refactor: improve display formatting for Union <a href="https://redirect.github.com/apache/arrow-rs/pull/8529">#8529</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use <code>Arc<FileDecryptionProperties></code> to reduce size of ParquetMetadata and avoid copying when <code>encryption</code> is enabled <a href="https://redirect.github.com/apache/arrow-rs/pull/8470">#8470</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>Fix for column name based projection mask creation <a href="https://redirect.github.com/apache/arrow-rs/pull/8447">#8447</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>Improve Display formatting of DataType::Timestamp <a href="https://redirect.github.com/apache/arrow-rs/pull/8425">#8425</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Use more compact Debug formatting of Field <a href="https://redirect.github.com/apache/arrow-rs/pull/8424">#8424</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Reuse zstd compression context when writing IPC <a href="https://redirect.github.com/apache/arrow-rs/pull/8405">#8405</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>] (<a href="https://github.com/albertlockett">albertlockett</a>)</li> <li>[Decimal] Add scale argument to validation functions to ensure accurate error logging <a href="https://redirect.github.com/apache/arrow-rs/pull/8396">#8396</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Quote <code>DataType::Struct</code> field names in <code>Display</code> formatting <a href="https://redirect.github.com/apache/arrow-rs/pull/8291">#8291</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Improve <code>Display</code> for <code>DataType</code> and <code>Field</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8290">#8290</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Bump pyo3 to 0.26.0 <a href="https://redirect.github.com/apache/arrow-rs/pull/8286">#8286</a> (<a href="https://github.com/mbrobbel">mbrobbel</a>)</li> </ul> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Added Avro support (new <code>arrow-avro</code> crate) <a href="https://redirect.github.com/apache/arrow-rs/issues/4886">#4886</a></li> <li>parquet-rewrite: supports compression level and write batch size <a href="https://redirect.github.com/apache/arrow-rs/issues/8639">#8639</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/blob/main/CHANGELOG.md">arrow-schema's changelog</a>.</em></p> <blockquote> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.1.0">57.1.0</a> (2025-11-20)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/57.0.0...57.1.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Eliminate bound checks in filter kernels <a href="https://redirect.github.com/apache/arrow-rs/issues/8865">#8865</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Respect page index policy option for ParquetObjectReader when it's not skip <a href="https://redirect.github.com/apache/arrow-rs/issues/8856">#8856</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Speed up collect_bool and remove <code>unsafe</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8848">#8848</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Error reading parquet FileMetaData with empty lists encoded as element-type=0 <a href="https://redirect.github.com/apache/arrow-rs/issues/8826">#8826</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>ValueStatistics methods can't be used from generic context in external crate <a href="https://redirect.github.com/apache/arrow-rs/issues/8823">#8823</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Custom Pretty-Printing Implementation for Column when Formatting Record Batches <a href="https://redirect.github.com/apache/arrow-rs/issues/8821">#8821</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet-concat: supports bloom filter and page index <a href="https://redirect.github.com/apache/arrow-rs/issues/8804">#8804</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[Parquet] virtual row group number support <a href="https://redirect.github.com/apache/arrow-rs/issues/8800">#8800</a></li> <li>[Variant] Enforce shredded-type validation in <code>shred_variant</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8795">#8795</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Simplify decision logic to call <code>FilterBuilder::optimize</code> or not <a href="https://redirect.github.com/apache/arrow-rs/issues/8781">#8781</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add variant to arrow for DataType::{Binary, LargeBinary, BinaryView} <a href="https://redirect.github.com/apache/arrow-rs/issues/8767">#8767</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Provide algorithm that allows zipping arrays whose values are not prealigned <a href="https://redirect.github.com/apache/arrow-rs/issues/8752">#8752</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Parquet] ParquetMetadataReader decodes too much metadata under point-get scenerio <a href="https://redirect.github.com/apache/arrow-rs/issues/8751">#8751</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li><code>arrow-json</code> supports encoding binary arrays, but not decoding <a href="https://redirect.github.com/apache/arrow-rs/issues/8736">#8736</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Allow <code>FilterPredicate</code> instances to be reused for RecordBatches <a href="https://redirect.github.com/apache/arrow-rs/issues/8692">#8692</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ArrowJsonBatch::from_batch is incomplete <a href="https://redirect.github.com/apache/arrow-rs/issues/8684">#8684</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>parquet-layout: More info about layout including footer size, page index, bloom filter? <a href="https://redirect.github.com/apache/arrow-rs/issues/8682">#8682</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Rewrite <code>ParquetRecordBatchStream</code> (async API) in terms of the PushDecoder <a href="https://redirect.github.com/apache/arrow-rs/issues/8677">#8677</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[JSON] Add encoding for binary view <a href="https://redirect.github.com/apache/arrow-rs/issues/8674">#8674</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Refactor arrow-cast decimal casting to unify the rescale logic used in Parquet variant casts <a href="https://redirect.github.com/apache/arrow-rs/issues/8670">#8670</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Support Uuid/<code>FixedSizeBinary(16)</code> shredding <a href="https://redirect.github.com/apache/arrow-rs/issues/8665">#8665</a></li> <li>[Parquet]There should be an encoding counter to know how many encodings the repo supports in total <a href="https://redirect.github.com/apache/arrow-rs/issues/8662">#8662</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Improve <code>parse_data_type</code> for <code>List</code>, <code>ListView</code>, <code>LargeList</code>, <code>LargeListView</code>, <code>FixedSizeList</code>, <code>Union</code>, <code>Map</code>, <code>RunEndCoded</code>. <a href="https://redirect.github.com/apache/arrow-rs/issues/8648">#8648</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/arrow-rs/commit/5faa7248bd0d2efec9e072c3f58014691c4bddf3"><code>5faa724</code></a> Prepare for <code>57.0.0</code> release (<a href="https://redirect.github.com/apache/arrow-rs/issues/8644">#8644</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/0c33129b890027b6d421fdfe153e192196b65ad9"><code>0c33129</code></a> parquet-rewrite: add write_batch_size and compression_level config (<a href="https://redirect.github.com/apache/arrow-rs/issues/8642">#8642</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/2f96204ec4a38246bd14ed9b45252498326a5b27"><code>2f96204</code></a> Adding <code>try_append_value</code> implementation to <code>ByteViewBuilder</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8594">#8594</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f131b5469655c2a1afc3b23ce5e3f850d6a389cf"><code>f131b54</code></a> bench: create <code>zip</code> kernel benchmarks (<a href="https://redirect.github.com/apache/arrow-rs/issues/8654">#8654</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/d49f017fe1c6712ba32e2222c6f031278b588ca5"><code>d49f017</code></a> Introduce a ThriftProtocolError to avoid allocating and formattings strings f...</li> <li><a href="https://github.com/apache/arrow-rs/commit/a85d8b025610ad7eea2266cb4ac383f7fd0c767e"><code>a85d8b0</code></a> [Variant] impl <code>PartialEq</code> and <code>FromIterator\<Option<..>></code> for <code>VariantArray</code> ...</li> <li><a href="https://github.com/apache/arrow-rs/commit/2ca3d609cc99e0c153d1ab2e62af16a4b3f79a12"><code>2ca3d60</code></a> fix: incorrect assertion in <code>BitChunks::new</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8620">#8620</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/5a384f4c3ccd397dcb8763d89e958da3fa4c666c"><code>5a384f4</code></a> Undeprecate <code>ArrowWriter::into_serialized_writer</code> and add docs (<a href="https://redirect.github.com/apache/arrow-rs/issues/8621">#8621</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f3baa809ef1a21fcfa23268ec7c92896353d94f2"><code>f3baa80</code></a> [thrift-remodel] Add macro to reduce boilerplate necessary to implement Thrif...</li> <li><a href="https://github.com/apache/arrow-rs/commit/caeb4d2e88fb0fea57f1d30b139be1fd6195f174"><code>caeb4d2</code></a> feat: Improve DataType display for <code>RunEndEncoded</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8596">#8596</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">compare view</a></li> </ul> </details> <br /> Updates `arrow-select` from 56.2.0 to 57.0.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/releases">arrow-select's releases</a>.</em></p> <blockquote> <h2>arrow 57.0.0</h2> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.0.0">57.0.0</a> (2025-10-19)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">Full Changelog</a></p> <p><strong>Breaking changes:</strong></p> <ul> <li>Use <code>Arc<FileEncryptionProperties></code> everywhere to be be consistent with <code>FileDecryptionProperties</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8626">#8626</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>feat: Improve DataType display for <code>RunEndEncoded</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8596">#8596</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Add <code>ArrowError::AvroError</code>, remaining types and roundtrip tests to <code>arrow-avro</code>, <a href="https://redirect.github.com/apache/arrow-rs/pull/8595">#8595</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/jecsand838">jecsand838</a>)</li> <li>[thrift-remodel] Refactor Thrift encryption and store encodings as bitmask <a href="https://redirect.github.com/apache/arrow-rs/pull/8587">#8587</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>feat: Enhance <code>Map</code> display formatting in DataType <a href="https://redirect.github.com/apache/arrow-rs/pull/8570">#8570</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>feat: Enhance DataType display formatting for <code>ListView</code> and <code>LargeListView</code> variants <a href="https://redirect.github.com/apache/arrow-rs/pull/8569">#8569</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use custom thrift parser for parquet metadata (phase 1 of Thrift remodel) <a href="https://redirect.github.com/apache/arrow-rs/pull/8530">#8530</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>refactor: improve display formatting for Union <a href="https://redirect.github.com/apache/arrow-rs/pull/8529">#8529</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Use <code>Arc<FileDecryptionProperties></code> to reduce size of ParquetMetadata and avoid copying when <code>encryption</code> is enabled <a href="https://redirect.github.com/apache/arrow-rs/pull/8470">#8470</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>Fix for column name based projection mask creation <a href="https://redirect.github.com/apache/arrow-rs/pull/8447">#8447</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/etseidl">etseidl</a>)</li> <li>Improve Display formatting of DataType::Timestamp <a href="https://redirect.github.com/apache/arrow-rs/pull/8425">#8425</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Use more compact Debug formatting of Field <a href="https://redirect.github.com/apache/arrow-rs/pull/8424">#8424</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Reuse zstd compression context when writing IPC <a href="https://redirect.github.com/apache/arrow-rs/pull/8405">#8405</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>] (<a href="https://github.com/albertlockett">albertlockett</a>)</li> <li>[Decimal] Add scale argument to validation functions to ensure accurate error logging <a href="https://redirect.github.com/apache/arrow-rs/pull/8396">#8396</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/Weijun-H">Weijun-H</a>)</li> <li>Quote <code>DataType::Struct</code> field names in <code>Display</code> formatting <a href="https://redirect.github.com/apache/arrow-rs/pull/8291">#8291</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Improve <code>Display</code> for <code>DataType</code> and <code>Field</code> <a href="https://redirect.github.com/apache/arrow-rs/pull/8290">#8290</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] (<a href="https://github.com/emilk">emilk</a>)</li> <li>Bump pyo3 to 0.26.0 <a href="https://redirect.github.com/apache/arrow-rs/pull/8286">#8286</a> (<a href="https://github.com/mbrobbel">mbrobbel</a>)</li> </ul> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Added Avro support (new <code>arrow-avro</code> crate) <a href="https://redirect.github.com/apache/arrow-rs/issues/4886">#4886</a></li> <li>parquet-rewrite: supports compression level and write batch size <a href="https://redirect.github.com/apache/arrow-rs/issues/8639">#8639</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/blob/main/CHANGELOG.md">arrow-select's changelog</a>.</em></p> <blockquote> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/57.1.0">57.1.0</a> (2025-11-20)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/57.0.0...57.1.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Eliminate bound checks in filter kernels <a href="https://redirect.github.com/apache/arrow-rs/issues/8865">#8865</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Respect page index policy option for ParquetObjectReader when it's not skip <a href="https://redirect.github.com/apache/arrow-rs/issues/8856">#8856</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Speed up collect_bool and remove <code>unsafe</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8848">#8848</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Error reading parquet FileMetaData with empty lists encoded as element-type=0 <a href="https://redirect.github.com/apache/arrow-rs/issues/8826">#8826</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>ValueStatistics methods can't be used from generic context in external crate <a href="https://redirect.github.com/apache/arrow-rs/issues/8823">#8823</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Custom Pretty-Printing Implementation for Column when Formatting Record Batches <a href="https://redirect.github.com/apache/arrow-rs/issues/8821">#8821</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet-concat: supports bloom filter and page index <a href="https://redirect.github.com/apache/arrow-rs/issues/8804">#8804</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[Parquet] virtual row group number support <a href="https://redirect.github.com/apache/arrow-rs/issues/8800">#8800</a></li> <li>[Variant] Enforce shredded-type validation in <code>shred_variant</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/8795">#8795</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Simplify decision logic to call <code>FilterBuilder::optimize</code> or not <a href="https://redirect.github.com/apache/arrow-rs/issues/8781">#8781</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add variant to arrow for DataType::{Binary, LargeBinary, BinaryView} <a href="https://redirect.github.com/apache/arrow-rs/issues/8767">#8767</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Provide algorithm that allows zipping arrays whose values are not prealigned <a href="https://redirect.github.com/apache/arrow-rs/issues/8752">#8752</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Parquet] ParquetMetadataReader decodes too much metadata under point-get scenerio <a href="https://redirect.github.com/apache/arrow-rs/issues/8751">#8751</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li><code>arrow-json</code> supports encoding binary arrays, but not decoding <a href="https://redirect.github.com/apache/arrow-rs/issues/8736">#8736</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Allow <code>FilterPredicate</code> instances to be reused for RecordBatches <a href="https://redirect.github.com/apache/arrow-rs/issues/8692">#8692</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ArrowJsonBatch::from_batch is incomplete <a href="https://redirect.github.com/apache/arrow-rs/issues/8684">#8684</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>parquet-layout: More info about layout including footer size, page index, bloom filter? <a href="https://redirect.github.com/apache/arrow-rs/issues/8682">#8682</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Rewrite <code>ParquetRecordBatchStream</code> (async API) in terms of the PushDecoder <a href="https://redirect.github.com/apache/arrow-rs/issues/8677">#8677</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>[JSON] Add encoding for binary view <a href="https://redirect.github.com/apache/arrow-rs/issues/8674">#8674</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Refactor arrow-cast decimal casting to unify the rescale logic used in Parquet variant casts <a href="https://redirect.github.com/apache/arrow-rs/issues/8670">#8670</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Support Uuid/<code>FixedSizeBinary(16)</code> shredding <a href="https://redirect.github.com/apache/arrow-rs/issues/8665">#8665</a></li> <li>[Parquet]There should be an encoding counter to know how many encodings the repo supports in total <a href="https://redirect.github.com/apache/arrow-rs/issues/8662">#8662</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Improve <code>parse_data_type</code> for <code>List</code>, <code>ListView</code>, <code>LargeList</code>, <code>LargeListView</code>, <code>FixedSizeList</code>, <code>Union</code>, <code>Map</code>, <code>RunEndCoded</code>. <a href="https://redirect.github.com/apache/arrow-rs/issues/8648">#8648</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/arrow-rs/commit/5faa7248bd0d2efec9e072c3f58014691c4bddf3"><code>5faa724</code></a> Prepare for <code>57.0.0</code> release (<a href="https://redirect.github.com/apache/arrow-rs/issues/8644">#8644</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/0c33129b890027b6d421fdfe153e192196b65ad9"><code>0c33129</code></a> parquet-rewrite: add write_batch_size and compression_level config (<a href="https://redirect.github.com/apache/arrow-rs/issues/8642">#8642</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/2f96204ec4a38246bd14ed9b45252498326a5b27"><code>2f96204</code></a> Adding <code>try_append_value</code> implementation to <code>ByteViewBuilder</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8594">#8594</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f131b5469655c2a1afc3b23ce5e3f850d6a389cf"><code>f131b54</code></a> bench: create <code>zip</code> kernel benchmarks (<a href="https://redirect.github.com/apache/arrow-rs/issues/8654">#8654</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/d49f017fe1c6712ba32e2222c6f031278b588ca5"><code>d49f017</code></a> Introduce a ThriftProtocolError to avoid allocating and formattings strings f...</li> <li><a href="https://github.com/apache/arrow-rs/commit/a85d8b025610ad7eea2266cb4ac383f7fd0c767e"><code>a85d8b0</code></a> [Variant] impl <code>PartialEq</code> and <code>FromIterator\<Option<..>></code> for <code>VariantArray</code> ...</li> <li><a href="https://github.com/apache/arrow-rs/commit/2ca3d609cc99e0c153d1ab2e62af16a4b3f79a12"><code>2ca3d60</code></a> fix: incorrect assertion in <code>BitChunks::new</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8620">#8620</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/5a384f4c3ccd397dcb8763d89e958da3fa4c666c"><code>5a384f4</code></a> Undeprecate <code>ArrowWriter::into_serialized_writer</code> and add docs (<a href="https://redirect.github.com/apache/arrow-rs/issues/8621">#8621</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/f3baa809ef1a21fcfa23268ec7c92896353d94f2"><code>f3baa80</code></a> [thrift-remodel] Add macro to reduce boilerplate necessary to implement Thrif...</li> <li><a href="https://github.com/apache/arrow-rs/commit/caeb4d2e88fb0fea57f1d30b139be1fd6195f174"><code>caeb4d2</code></a> feat: Improve DataType display for <code>RunEndEncoded</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/8596">#8596</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/arrow-rs/compare/56.2.0...57.0.0">compare view</a></li> </ul> </details> <br /> Updates `datafusion` from 50.3.0 to 51.0.0 <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/datafusion/commit/fd35a09438a2b4841431f5e86ffef378cbbda7c9"><code>fd35a09</code></a> [branch-51] Update changelog for 51.0.0 RC2 (<a href="https://redirect.github.com/apache/datafusion/issues/18705">#18705</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/187c6b25f56dcd8f6f1ac2edf71d4e4e18a44e2c"><code>187c6b2</code></a> [branch-51] bugfix: select_columns should validate column names (<a href="https://redirect.github.com/apache/datafusion/issues/18624">#18624</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/6dcde50500cc59d0bd68fcad3fa2263264f1ab39"><code>6dcde50</code></a> [branch-51]: Add timezone to date_trunc fast path (<a href="https://redirect.github.com/apache/datafusion/issues/18596">#18596</a>) (<a href="https://redirect.github.com/apache/datafusion/issues/18629">#18629</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/5c743d766068a89dbaa7c624d2eb0c1553b50c42"><code>5c743d7</code></a> [branch-51] bugfix: correct regression on TableType in into_view in DF51 (<a href="https://redirect.github.com/apache/datafusion/issues/18">#18</a>...</li> <li><a href="https://github.com/apache/datafusion/commit/49e347bb9690b19d94fb1e46f093e58106c622ad"><code>49e347b</code></a> [branch-51] Update Changelog (<a href="https://redirect.github.com/apache/datafusion/issues/18592">#18592</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/2677c27541d9ec568434b8b99f136e45c3d383bf"><code>2677c27</code></a> [branch-51] Revert rewrite for coalesce, <code>nvl</code> and <code>nvl2</code> simplification (<a href="https://redirect.github.com/apache/datafusion/issues/18">#18</a>...</li> <li><a href="https://github.com/apache/datafusion/commit/ff96b3b72cec5bb6550cfc2f7633fa3ff162b86d"><code>ff96b3b</code></a> [branch-51] Update version to 51.0.0, add Changelog (<a href="https://redirect.github.com/apache/datafusion/issues/18551">#18551</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/c826009586b55a1688dd0c119974d4913072f342"><code>c826009</code></a> Refactor <code>log()</code> signature to use coercion API + fixes (<a href="https://redirect.github.com/apache/datafusion/issues/18519">#18519</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/2233796e789bf2a230067f915ec952da8eeec988"><code>2233796</code></a> minor: Remove inconsistent comment (<a href="https://redirect.github.com/apache/datafusion/issues/18539">#18539</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/fe24690997597bb8402acd456b45a9075683e178"><code>fe24690</code></a> Add comments to Cargo.toml about workspace overrides (<a href="https://redirect.github.com/apache/datafusion/issues/18526">#18526</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/datafusion/compare/50.3.0...51.0.0">compare view</a></li> </ul> </details> <br /> Updates `datafusion-substrait` from 50.3.0 to 51.0.0 <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/datafusion/commit/fd35a09438a2b4841431f5e86ffef378cbbda7c9"><code>fd35a09</code></a> [branch-51] Update changelog for 51.0.0 RC2 (<a href="https://redirect.github.com/apache/datafusion/issues/18705">#18705</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/187c6b25f56dcd8f6f1ac2edf71d4e4e18a44e2c"><code>187c6b2</code></a> [branch-51] bugfix: select_columns should validate column names (<a href="https://redirect.github.com/apache/datafusion/issues/18624">#18624</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/6dcde50500cc59d0bd68fcad3fa2263264f1ab39"><code>6dcde50</code></a> [branch-51]: Add timezone to date_trunc fast path (<a href="https://redirect.github.com/apache/datafusion/issues/18596">#18596</a>) (<a href="https://redirect.github.com/apache/datafusion/issues/18629">#18629</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/5c743d766068a89dbaa7c624d2eb0c1553b50c42"><code>5c743d7</code></a> [branch-51] bugfix: correct regression on TableType in into_view in DF51 (<a href="https://redirect.github.com/apache/datafusion/issues/18">#18</a>...</li> <li><a href="https://github.com/apache/datafusion/commit/49e347bb9690b19d94fb1e46f093e58106c622ad"><code>49e347b</code></a> [branch-51] Update Changelog (<a href="https://redirect.github.com/apache/datafusion/issues/18592">#18592</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/2677c27541d9ec568434b8b99f136e45c3d383bf"><code>2677c27</code></a> [branch-51] Revert rewrite for coalesce, <code>nvl</code> and <code>nvl2</code> simplification (<a href="https://redirect.github.com/apache/datafusion/issues/18">#18</a>...</li> <li><a href="https://github.com/apache/datafusion/commit/ff96b3b72cec5bb6550cfc2f7633fa3ff162b86d"><code>ff96b3b</code></a> [branch-51] Update version to 51.0.0, add Changelog (<a href="https://redirect.github.com/apache/datafusion/issues/18551">#18551</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/c826009586b55a1688dd0c119974d4913072f342"><code>c826009</code></a> Refactor <code>log()</code> signature to use coercion API + fixes (<a href="https://redirect.github.com/apache/datafusion/issues/18519">#18519</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/2233796e789bf2a230067f915ec952da8eeec988"><code>2233796</code></a> minor: Remove inconsistent comment (<a href="https://redirect.github.com/apache/datafusion/issues/18539">#18539</a>)</li> <li><a href="https://github.com/apache/datafusion/commit/fe24690997597bb8402acd456b45a9075683e178"><code>fe24690</code></a> Add comments to Cargo.toml about workspace overrides (<a href="https://redirect.github.com/apache/datafusion/issues/18526">#18526</a>)</li> <li>Additional commits viewable in <a href="https://github.com/apache/datafusion/compare/50.3.0...51.0.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore <dependency name> major version` will close this group upd... _Description has been truncated_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
