This is an automated email from the ASF dual-hosted git repository.
alamb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/arrow-rs.git
The following commit(s) were added to refs/heads/master by this push:
new 774b721452 Prepare arrow/parquet `53.0.0` release (#6338)
774b721452 is described below
commit 774b721452877346cf7fa95c3b0deb181955bea8
Author: Andrew Lamb <[email protected]>
AuthorDate: Sat Aug 31 10:00:13 2024 -0400
Prepare arrow/parquet `53.0.0` release (#6338)
* Update version to 53.0.0
* Update changelog script
* Update CHANGELOG.md
* update changelog
---
CHANGELOG-old.md | 67 ++++++++++++++
CHANGELOG.md | 189 ++++++++++++++++++++++++++++-----------
Cargo.toml | 32 +++----
dev/release/update_change_log.sh | 4 +-
4 files changed, 223 insertions(+), 69 deletions(-)
diff --git a/CHANGELOG-old.md b/CHANGELOG-old.md
index 8b70fea397..4808cde703 100644
--- a/CHANGELOG-old.md
+++ b/CHANGELOG-old.md
@@ -19,6 +19,73 @@
# Historical Changelog
+## [52.2.0](https://github.com/apache/arrow-rs/tree/52.2.0) (2024-07-24)
+
+[Full Changelog](https://github.com/apache/arrow-rs/compare/52.1.0...52.2.0)
+
+**Implemented enhancements:**
+
+- Faster min/max for string/binary view arrays
[\#6088](https://github.com/apache/arrow-rs/issues/6088)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Support casting to/from Utf8View
[\#6076](https://github.com/apache/arrow-rs/issues/6076)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Min/max support for String/BinaryViewArray
[\#6052](https://github.com/apache/arrow-rs/issues/6052)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Improve performance of constructing `ByteView`s for small strings
[\#6034](https://github.com/apache/arrow-rs/issues/6034)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Fast UTF-8 validation when reading StringViewArray from Parquet
[\#5995](https://github.com/apache/arrow-rs/issues/5995)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Optimize StringView row decoding
[\#5945](https://github.com/apache/arrow-rs/issues/5945)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Implementing `deduplicate` / `intern` functionality for StringView
[\#5910](https://github.com/apache/arrow-rs/issues/5910)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Add `FlightSqlServiceClient::new_from_inner`
[\#6003](https://github.com/apache/arrow-rs/pull/6003)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
+- Complete `StringViewArray` and `BinaryViewArray` parquet decoder:
[\#6004](https://github.com/apache/arrow-rs/pull/6004)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add begin/end\_transaction methods in FlightSqlServiceClient
[\#6026](https://github.com/apache/arrow-rs/pull/6026)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
+- Read Parquet statistics as arrow `Arrays`
[\#6046](https://github.com/apache/arrow-rs/pull/6046)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([efredine](https://github.com/efredine))
+
+**Fixed bugs:**
+
+- Panic in `ParquetMetadata::memory_size` if no min/max set
[\#6091](https://github.com/apache/arrow-rs/issues/6091)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- BinaryViewArray doesn't roundtrip a single `Some(&[])` through parquet
[\#6086](https://github.com/apache/arrow-rs/issues/6086)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Parquet `ColumnIndex` for null columns is written even when statistics are
disabled [\#6010](https://github.com/apache/arrow-rs/issues/6010)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+
+**Documentation updates:**
+
+- Fix typo in GenericByteViewArray documentation
[\#6054](https://github.com/apache/arrow-rs/pull/6054)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([progval](https://github.com/progval))
+- Minor: Improve parquet PageIndex documentation
[\#6042](https://github.com/apache/arrow-rs/pull/6042)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+
+**Closed issues:**
+
+- Potential performance improvements for reading Parquet to
StringViewArray/BinaryViewArray
[\#5904](https://github.com/apache/arrow-rs/issues/5904)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+
+**Merged pull requests:**
+
+- Faster `GenericByteView` construction
[\#6102](https://github.com/apache/arrow-rs/pull/6102)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add benchmark to track byte-view construction performance
[\#6101](https://github.com/apache/arrow-rs/pull/6101)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Optimize `bool_or` using `max_boolean`
[\#6100](https://github.com/apache/arrow-rs/pull/6100)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
+- Optimize `max_boolean` by operating on u64 chunks
[\#6098](https://github.com/apache/arrow-rs/pull/6098)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
+- fix panic in `ParquetMetadata::memory_size`: check has\_min\_max\_set before
invoking min\(\)/max\(\) [\#6092](https://github.com/apache/arrow-rs/pull/6092)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Fischer0522](https://github.com/Fischer0522))
+- Implement specialized min/max for `GenericBinaryView` \(`StringView` and
`BinaryView`\) [\#6089](https://github.com/apache/arrow-rs/pull/6089)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add PartialEq to ParquetMetaData and FileMetadata
[\#6082](https://github.com/apache/arrow-rs/pull/6082)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([adriangb](https://github.com/adriangb))
+- Enable casting from Utf8View
[\#6077](https://github.com/apache/arrow-rs/pull/6077)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([a10y](https://github.com/a10y))
+- StringView support in arrow-csv
[\#6062](https://github.com/apache/arrow-rs/pull/6062)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([2010YOUY01](https://github.com/2010YOUY01))
+- Implement min max support for string/binary view types
[\#6053](https://github.com/apache/arrow-rs/pull/6053)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Minor: clarify the relationship between `file::metadata` and `format` in
docs [\#6049](https://github.com/apache/arrow-rs/pull/6049)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Minor API adjustments for StringViewBuilder
[\#6047](https://github.com/apache/arrow-rs/pull/6047)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add parquet `StatisticsConverter` for arrow reader
[\#6046](https://github.com/apache/arrow-rs/pull/6046)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([efredine](https://github.com/efredine))
+- Directly decode String/BinaryView types from arrow-row format
[\#6044](https://github.com/apache/arrow-rs/pull/6044)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Clean up unused code for view types in offset buffer
[\#6040](https://github.com/apache/arrow-rs/pull/6040)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Avoid using Buffer api that accidentally copies data
[\#6039](https://github.com/apache/arrow-rs/pull/6039)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([XiangpengHao](https://github.com/XiangpengHao))
+- MINOR: Fix `hashbrown` version in `arrow-array`, remove from `arrow-row`
[\#6035](https://github.com/apache/arrow-rs/pull/6035)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([mbrobbel](https://github.com/mbrobbel))
+- Improve performance reading `ByteViewArray` from parquet by removing an
implicit copy [\#6031](https://github.com/apache/arrow-rs/pull/6031)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add begin/end\_transaction methods in FlightSqlServiceClient
[\#6026](https://github.com/apache/arrow-rs/pull/6026)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
+- Unsafe improvements: core `parquet` crate.
[\#6024](https://github.com/apache/arrow-rs/pull/6024)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([veluca93](https://github.com/veluca93))
+- Additional tests for parquet reader utf8 validation
[\#6023](https://github.com/apache/arrow-rs/pull/6023)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Update zstd-sys requirement from \>=2.0.0, \<2.0.12 to \>=2.0.0, \<2.0.13
[\#6019](https://github.com/apache/arrow-rs/pull/6019)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([dependabot[bot]](https://github.com/apps/dependabot))
+- fix doc ci in latest rust nightly version
[\#6012](https://github.com/apache/arrow-rs/pull/6012)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([Rachelint](https://github.com/Rachelint))
+- Do not write `ColumnIndex` for null columns when not writing page statistics
[\#6011](https://github.com/apache/arrow-rs/pull/6011)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Fast utf8 validation when loading string view from parquet
[\#6009](https://github.com/apache/arrow-rs/pull/6009)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Deduplicate strings/binarys when building view types
[\#6005](https://github.com/apache/arrow-rs/pull/6005)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Complete `StringViewArray` and `BinaryViewArray` parquet decoder: implement
delta byte array and delta length byte array encoding
[\#6004](https://github.com/apache/arrow-rs/pull/6004)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Add `FlightSqlServiceClient::new_from_inner`
[\#6003](https://github.com/apache/arrow-rs/pull/6003)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
+- Rename `Schema::all_fields` to `flattened_fields`
[\#6001](https://github.com/apache/arrow-rs/pull/6001)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
+- Refine documentation and examples for `DataType`
[\#5997](https://github.com/apache/arrow-rs/pull/5997)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- implement `DataType::try_form(&str)`
[\#5994](https://github.com/apache/arrow-rs/pull/5994)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- Implement dictionary support for reading ByteView from parquet
[\#5973](https://github.com/apache/arrow-rs/pull/5973)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
## [52.1.0](https://github.com/apache/arrow-rs/tree/52.1.0) (2024-07-02)
[Full Changelog](https://github.com/apache/arrow-rs/compare/52.0.0...52.1.0)
diff --git a/CHANGELOG.md b/CHANGELOG.md
index a1ac7deefc..72b8090197 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -19,73 +19,160 @@
# Changelog
-## [52.2.0](https://github.com/apache/arrow-rs/tree/52.2.0) (2024-07-24)
+## [53.0.0](https://github.com/apache/arrow-rs/tree/53.0.0) (2024-08-31)
-[Full Changelog](https://github.com/apache/arrow-rs/compare/52.1.0...52.2.0)
+[Full Changelog](https://github.com/apache/arrow-rs/compare/52.2.0...53.0.0)
+
+**Breaking changes:**
+
+- parquet\_derive: Match fields by name, support reading selected fields
rather than all [\#6269](https://github.com/apache/arrow-rs/pull/6269)
([double-free](https://github.com/double-free))
+- Update parquet object\_store dependency to 0.11.0
[\#6264](https://github.com/apache/arrow-rs/pull/6264)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- parquet Statistics - deprecate `has_*` APIs and add `_opt` functions that
return `Option<T>` [\#6216](https://github.com/apache/arrow-rs/pull/6216)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Michael-J-Ward](https://github.com/Michael-J-Ward))
+- Expose bulk ingest in flight sql client and server
[\#6201](https://github.com/apache/arrow-rs/pull/6201)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([djanderson](https://github.com/djanderson))
+- Upgrade protobuf definitions to flightsql 17.0 \(\#6133\)
[\#6169](https://github.com/apache/arrow-rs/pull/6169)
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([alamb](https://github.com/alamb))
+- Remove automatic buffering in `ipc::reader::FileReader` for for consistent
buffering [\#6132](https://github.com/apache/arrow-rs/pull/6132)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([V0ldek](https://github.com/V0ldek))
+- No longer write Parquet column metadata after column chunks \*and\* in the
footer [\#6117](https://github.com/apache/arrow-rs/pull/6117)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
**Implemented enhancements:**
-- Faster min/max for string/binary view arrays
[\#6088](https://github.com/apache/arrow-rs/issues/6088)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Support casting to/from Utf8View
[\#6076](https://github.com/apache/arrow-rs/issues/6076)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Min/max support for String/BinaryViewArray
[\#6052](https://github.com/apache/arrow-rs/issues/6052)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Improve performance of constructing `ByteView`s for small strings
[\#6034](https://github.com/apache/arrow-rs/issues/6034)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Fast UTF-8 validation when reading StringViewArray from Parquet
[\#5995](https://github.com/apache/arrow-rs/issues/5995)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
-- Optimize StringView row decoding
[\#5945](https://github.com/apache/arrow-rs/issues/5945)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Implementing `deduplicate` / `intern` functionality for StringView
[\#5910](https://github.com/apache/arrow-rs/issues/5910)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
-- Add `FlightSqlServiceClient::new_from_inner`
[\#6003](https://github.com/apache/arrow-rs/pull/6003)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
-- Complete `StringViewArray` and `BinaryViewArray` parquet decoder:
[\#6004](https://github.com/apache/arrow-rs/pull/6004)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add begin/end\_transaction methods in FlightSqlServiceClient
[\#6026](https://github.com/apache/arrow-rs/pull/6026)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
-- Read Parquet statistics as arrow `Arrays`
[\#6046](https://github.com/apache/arrow-rs/pull/6046)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([efredine](https://github.com/efredine))
+- Derive `PartialEq` and `Eq` for `parquet::arrow::ProjectionMask`
[\#6329](https://github.com/apache/arrow-rs/issues/6329)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Allow converting empty `pyarrow.RecordBatch` to `arrow::RecordBatch`
[\#6318](https://github.com/apache/arrow-rs/issues/6318)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Parquet writer should not write any min/max data to ColumnIndex when all
values are null [\#6315](https://github.com/apache/arrow-rs/issues/6315)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Parquet: Add `union` method to `RowSelection`
[\#6307](https://github.com/apache/arrow-rs/issues/6307)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Support writing `UTC adjusted time` arrow array to parquet
[\#6277](https://github.com/apache/arrow-rs/issues/6277)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- A better way to resize the buffer for the snappy encode/decode
[\#6276](https://github.com/apache/arrow-rs/issues/6276)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- parquet\_derive: support reading selected columns from parquet file
[\#6268](https://github.com/apache/arrow-rs/issues/6268)
+- Tests for invalid parquet files
[\#6261](https://github.com/apache/arrow-rs/issues/6261)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Implement `date_part` for `Duration`
[\#6245](https://github.com/apache/arrow-rs/issues/6245)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Avoid unnecessary null buffer construction when converting arrays to a
different type [\#6243](https://github.com/apache/arrow-rs/issues/6243)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Add `parquet_opendal` in related projects
[\#6235](https://github.com/apache/arrow-rs/issues/6235)
+- Look into optimizing reading FixedSizeBinary arrays from parquet
[\#6219](https://github.com/apache/arrow-rs/issues/6219)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Add benchmarks for `BYTE_STREAM_SPLIT` encoded Parquet
`FIXED_LEN_BYTE_ARRAY` data
[\#6203](https://github.com/apache/arrow-rs/issues/6203)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Make it easy to write parquet to object\_store -- Implement
`AsyncFileWriter` for a type that implements `obj_store::MultipartUpload` for
`AsyncArrowWriter` [\#6200](https://github.com/apache/arrow-rs/issues/6200)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Remove test duplication in parquet statistics tets
[\#6185](https://github.com/apache/arrow-rs/issues/6185)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Support BinaryView Types in C Schema FFI
[\#6170](https://github.com/apache/arrow-rs/issues/6170)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- speedup take\_byte\_view kernel
[\#6167](https://github.com/apache/arrow-rs/issues/6167)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Add support for `StringView` and `BinaryView` statistics in
`StatisticsConverter` [\#6164](https://github.com/apache/arrow-rs/issues/6164)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Support casting `BinaryView` --\> `Utf8` and `LargeUtf8`
[\#6162](https://github.com/apache/arrow-rs/issues/6162)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Implement `filter` kernel specially for `FixedSizeByteArray`
[\#6153](https://github.com/apache/arrow-rs/issues/6153)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Use `LevelHistogram` throughout Parquet metadata
[\#6134](https://github.com/apache/arrow-rs/issues/6134)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Support DoPutStatementIngest from Arrow Flight SQL 17.0
[\#6124](https://github.com/apache/arrow-rs/issues/6124)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
+- ColumnMetaData should no longer be written inline with data
[\#6115](https://github.com/apache/arrow-rs/issues/6115)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Implement date\_part for `Interval`
[\#6113](https://github.com/apache/arrow-rs/issues/6113)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Implement `Into<Arc<dyn Array>>` for `ArrayData`
[\#6104](https://github.com/apache/arrow-rs/issues/6104)
+- Allow flushing or non-buffered writes from
`arrow::ipc::writer::StreamWriter`
[\#6099](https://github.com/apache/arrow-rs/issues/6099)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Default block\_size for `StringViewArray`
[\#6094](https://github.com/apache/arrow-rs/issues/6094)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Remove `Statistics::has_min_max_set` and `ValueStatistics::has_min_max_set`
and use `Option` instead
[\#6093](https://github.com/apache/arrow-rs/issues/6093)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Upgrade arrow-flight to tonic 0.12
[\#6072](https://github.com/apache/arrow-rs/issues/6072)
+- Improve speed of row converter by skipping utf8 checks
[\#6058](https://github.com/apache/arrow-rs/issues/6058)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Extend support for BYTE\_STREAM\_SPLIT to FIXED\_LEN\_BYTE\_ARRAY, INT32,
and INT64 primitive types
[\#6048](https://github.com/apache/arrow-rs/issues/6048)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Release arrow-rs / parquet minor version `52.2.0` \(August 2024\)
[\#5998](https://github.com/apache/arrow-rs/issues/5998)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
**Fixed bugs:**
-- Panic in `ParquetMetadata::memory_size` if no min/max set
[\#6091](https://github.com/apache/arrow-rs/issues/6091)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
-- BinaryViewArray doesn't roundtrip a single `Some(&[])` through parquet
[\#6086](https://github.com/apache/arrow-rs/issues/6086)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
-- Parquet `ColumnIndex` for null columns is written even when statistics are
disabled [\#6010](https://github.com/apache/arrow-rs/issues/6010)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Invalid `ColumnIndex` written in parquet
[\#6310](https://github.com/apache/arrow-rs/issues/6310)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- comparison\_kernels benchmarks panic
[\#6283](https://github.com/apache/arrow-rs/issues/6283)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Printing schema metadata includes possibly incorrect compression level
[\#6270](https://github.com/apache/arrow-rs/issues/6270)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Don't panic when creating `Field` from `FFI_ArrowSchema` with no name
[\#6251](https://github.com/apache/arrow-rs/issues/6251)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- lexsort\_to\_indices should not fallback to non-lexical sort if the datatype
is not supported [\#6226](https://github.com/apache/arrow-rs/issues/6226)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Parquet Statistics null\_count does not distinguish between `0` and not
specified [\#6215](https://github.com/apache/arrow-rs/issues/6215)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
+- Using a take kernel on a dense union can result in reaching "unreachable"
code [\#6206](https://github.com/apache/arrow-rs/issues/6206)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Adding sub day seconds to Date64 is ignored.
[\#6198](https://github.com/apache/arrow-rs/issues/6198)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- mismatch between parquet type `is_optional` codes and comment
[\#6191](https://github.com/apache/arrow-rs/issues/6191)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
**Documentation updates:**
-- Fix typo in GenericByteViewArray documentation
[\#6054](https://github.com/apache/arrow-rs/pull/6054)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([progval](https://github.com/progval))
-- Minor: Improve parquet PageIndex documentation
[\#6042](https://github.com/apache/arrow-rs/pull/6042)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Minor: improve filter documentation
[\#6317](https://github.com/apache/arrow-rs/pull/6317)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: Improve comments on GenericByteViewArray::bytes\_iter\(\),
prefix\_iter\(\) and suffix\_iter\(\)
[\#6306](https://github.com/apache/arrow-rs/pull/6306)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: improve `RowFilter` and `ArrowPredicate` docs
[\#6301](https://github.com/apache/arrow-rs/pull/6301)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Improve documentation for `MutableArrayData`
[\#6272](https://github.com/apache/arrow-rs/pull/6272)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Add examples to `StringViewBuilder` and `BinaryViewBuilder`
[\#6240](https://github.com/apache/arrow-rs/pull/6240)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- minor: enhance document for ParquetField
[\#6239](https://github.com/apache/arrow-rs/pull/6239)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([mapleFU](https://github.com/mapleFU))
+- Minor: Improve Type documentation
[\#6224](https://github.com/apache/arrow-rs/pull/6224)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: Update `DateType::Date64` docs
[\#6223](https://github.com/apache/arrow-rs/pull/6223)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Add \(more\) Parquet Metadata Documentation
[\#6184](https://github.com/apache/arrow-rs/pull/6184)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Add additional documentation and examples to `ArrayAccessor`
[\#6141](https://github.com/apache/arrow-rs/pull/6141)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: improve comments in temporal.rs tests
[\#6140](https://github.com/apache/arrow-rs/pull/6140)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: Update release schedule in README
[\#6125](https://github.com/apache/arrow-rs/pull/6125)
([alamb](https://github.com/alamb))
**Closed issues:**
-- Potential performance improvements for reading Parquet to
StringViewArray/BinaryViewArray
[\#5904](https://github.com/apache/arrow-rs/issues/5904)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
+- Simplify take octokit workflow
[\#6279](https://github.com/apache/arrow-rs/issues/6279)
+- Make the bearer token visible in FlightSqlServiceClient
[\#6253](https://github.com/apache/arrow-rs/issues/6253)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
+- Port `take` workflow to use `oktokit`
[\#6242](https://github.com/apache/arrow-rs/issues/6242)
+- Remove `SchemaBuilder` dependency from `StructArray` constructors
[\#6138](https://github.com/apache/arrow-rs/issues/6138)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
**Merged pull requests:**
-- Faster `GenericByteView` construction
[\#6102](https://github.com/apache/arrow-rs/pull/6102)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add benchmark to track byte-view construction performance
[\#6101](https://github.com/apache/arrow-rs/pull/6101)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Optimize `bool_or` using `max_boolean`
[\#6100](https://github.com/apache/arrow-rs/pull/6100)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
-- Optimize `max_boolean` by operating on u64 chunks
[\#6098](https://github.com/apache/arrow-rs/pull/6098)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
-- fix panic in `ParquetMetadata::memory_size`: check has\_min\_max\_set before
invoking min\(\)/max\(\) [\#6092](https://github.com/apache/arrow-rs/pull/6092)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Fischer0522](https://github.com/Fischer0522))
-- Implement specialized min/max for `GenericBinaryView` \(`StringView` and
`BinaryView`\) [\#6089](https://github.com/apache/arrow-rs/pull/6089)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add PartialEq to ParquetMetaData and FileMetadata
[\#6082](https://github.com/apache/arrow-rs/pull/6082)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([adriangb](https://github.com/adriangb))
-- Enable casting from Utf8View
[\#6077](https://github.com/apache/arrow-rs/pull/6077)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([a10y](https://github.com/a10y))
-- StringView support in arrow-csv
[\#6062](https://github.com/apache/arrow-rs/pull/6062)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([2010YOUY01](https://github.com/2010YOUY01))
-- Implement min max support for string/binary view types
[\#6053](https://github.com/apache/arrow-rs/pull/6053)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Minor: clarify the relationship between `file::metadata` and `format` in
docs [\#6049](https://github.com/apache/arrow-rs/pull/6049)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
-- Minor API adjustments for StringViewBuilder
[\#6047](https://github.com/apache/arrow-rs/pull/6047)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add parquet `StatisticsConverter` for arrow reader
[\#6046](https://github.com/apache/arrow-rs/pull/6046)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([efredine](https://github.com/efredine))
-- Directly decode String/BinaryView types from arrow-row format
[\#6044](https://github.com/apache/arrow-rs/pull/6044)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Clean up unused code for view types in offset buffer
[\#6040](https://github.com/apache/arrow-rs/pull/6040)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Avoid using Buffer api that accidentally copies data
[\#6039](https://github.com/apache/arrow-rs/pull/6039)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([XiangpengHao](https://github.com/XiangpengHao))
-- MINOR: Fix `hashbrown` version in `arrow-array`, remove from `arrow-row`
[\#6035](https://github.com/apache/arrow-rs/pull/6035)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([mbrobbel](https://github.com/mbrobbel))
-- Improve performance reading `ByteViewArray` from parquet by removing an
implicit copy [\#6031](https://github.com/apache/arrow-rs/pull/6031)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add begin/end\_transaction methods in FlightSqlServiceClient
[\#6026](https://github.com/apache/arrow-rs/pull/6026)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
-- Unsafe improvements: core `parquet` crate.
[\#6024](https://github.com/apache/arrow-rs/pull/6024)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([veluca93](https://github.com/veluca93))
-- Additional tests for parquet reader utf8 validation
[\#6023](https://github.com/apache/arrow-rs/pull/6023)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
-- Update zstd-sys requirement from \>=2.0.0, \<2.0.12 to \>=2.0.0, \<2.0.13
[\#6019](https://github.com/apache/arrow-rs/pull/6019)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([dependabot[bot]](https://github.com/apps/dependabot))
-- fix doc ci in latest rust nightly version
[\#6012](https://github.com/apache/arrow-rs/pull/6012)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([Rachelint](https://github.com/Rachelint))
-- Do not write `ColumnIndex` for null columns when not writing page statistics
[\#6011](https://github.com/apache/arrow-rs/pull/6011)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
-- Fast utf8 validation when loading string view from parquet
[\#6009](https://github.com/apache/arrow-rs/pull/6009)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Deduplicate strings/binarys when building view types
[\#6005](https://github.com/apache/arrow-rs/pull/6005)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Complete `StringViewArray` and `BinaryViewArray` parquet decoder: implement
delta byte array and delta length byte array encoding
[\#6004](https://github.com/apache/arrow-rs/pull/6004)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
-- Add `FlightSqlServiceClient::new_from_inner`
[\#6003](https://github.com/apache/arrow-rs/pull/6003)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
-- Rename `Schema::all_fields` to `flattened_fields`
[\#6001](https://github.com/apache/arrow-rs/pull/6001)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([lewiszlw](https://github.com/lewiszlw))
-- Refine documentation and examples for `DataType`
[\#5997](https://github.com/apache/arrow-rs/pull/5997)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
-- implement `DataType::try_form(&str)`
[\#5994](https://github.com/apache/arrow-rs/pull/5994)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
-- Implement dictionary support for reading ByteView from parquet
[\#5973](https://github.com/apache/arrow-rs/pull/5973)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Derive PartialEq and Eq for parquet::arrow::ProjectionMask
[\#6330](https://github.com/apache/arrow-rs/pull/6330)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([thinkharderdev](https://github.com/thinkharderdev))
+- Support zero column `RecordBatch`es in pyarrow integration \(use
RecordBatchOptions when converting a pyarrow RecordBatch\)
[\#6320](https://github.com/apache/arrow-rs/pull/6320)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([Michael-J-Ward](https://github.com/Michael-J-Ward))
+- Fix writing of invalid Parquet ColumnIndex when row group contains null
pages [\#6319](https://github.com/apache/arrow-rs/pull/6319)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([adriangb](https://github.com/adriangb))
+- Pass empty vectors as min/max for all null pages when building ColumnIndex
[\#6316](https://github.com/apache/arrow-rs/pull/6316)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Update tonic-build requirement from =0.12.0 to =0.12.2
[\#6314](https://github.com/apache/arrow-rs/pull/6314)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([dependabot[bot]](https://github.com/apps/dependabot))
+- Parquet: add `union` method to `RowSelection`
[\#6308](https://github.com/apache/arrow-rs/pull/6308)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([sdd](https://github.com/sdd))
+- Specialize filter for structs and sparse unions
[\#6304](https://github.com/apache/arrow-rs/pull/6304)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([gstvg](https://github.com/gstvg))
+- Err on `try_from_le_slice`
[\#6295](https://github.com/apache/arrow-rs/pull/6295)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([samuelcolvin](https://github.com/samuelcolvin))
+- fix reference in doctest to size\_of which is not imported by default
[\#6286](https://github.com/apache/arrow-rs/pull/6286)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([rtyler](https://github.com/rtyler))
+- Support writing UTC adjusted time arrays to parquet
[\#6278](https://github.com/apache/arrow-rs/pull/6278)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([aykut-bozkurt](https://github.com/aykut-bozkurt))
+- Minor: `pub use ByteView` in arrow and improve documentation
[\#6275](https://github.com/apache/arrow-rs/pull/6275)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Fix accessing name from ffi schema
[\#6273](https://github.com/apache/arrow-rs/pull/6273)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([kylebarron](https://github.com/kylebarron))
+- Do not print compression level in schema printer
[\#6271](https://github.com/apache/arrow-rs/pull/6271)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([ttencate](https://github.com/ttencate))
+- ci: use octokit to add assignee
[\#6267](https://github.com/apache/arrow-rs/pull/6267)
([dsgibbons](https://github.com/dsgibbons))
+- Add tests for bad parquet files
[\#6262](https://github.com/apache/arrow-rs/pull/6262)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Add `Statistics::distinct_count_opt` and deprecate
`Statistics::distinct_count`
[\#6259](https://github.com/apache/arrow-rs/pull/6259)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Minor: move `FallibleRequestStream` and `FallibleTonicResponseStream` to a
module [\#6258](https://github.com/apache/arrow-rs/pull/6258)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([alamb](https://github.com/alamb))
+- Make the bearer token visible in FlightSqlServiceClient
[\#6254](https://github.com/apache/arrow-rs/pull/6254)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([ccciudatu](https://github.com/ccciudatu))
+- Use `unary()` for array conversion in Parquet array readers, speed up
`Decimal128`, `Decimal256` and `Float16`
[\#6252](https://github.com/apache/arrow-rs/pull/6252)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([etseidl](https://github.com/etseidl))
+- Update tower requirement from 0.4.13 to 0.5.0
[\#6250](https://github.com/apache/arrow-rs/pull/6250)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([dependabot[bot]](https://github.com/apps/dependabot))
+- Implement date\_part for durations
[\#6246](https://github.com/apache/arrow-rs/pull/6246)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([nrc](https://github.com/nrc))
+- Remove unnecessary null buffer construction when converting arrays to a
different type [\#6244](https://github.com/apache/arrow-rs/pull/6244)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([etseidl](https://github.com/etseidl))
+- Implement PartialEq for GenericByteViewArray
[\#6241](https://github.com/apache/arrow-rs/pull/6241)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([alamb](https://github.com/alamb))
+- Minor: Remove non standard footer from LICENSE.txt / reference to Apache
Aurora [\#6237](https://github.com/apache/arrow-rs/pull/6237)
([alamb](https://github.com/alamb))
+- docs: Add parquet\_opendal in related projects
[\#6236](https://github.com/apache/arrow-rs/pull/6236)
([Xuanwo](https://github.com/Xuanwo))
+- Avoid infinite loop in bad parquet by checking the number of rep levels
[\#6232](https://github.com/apache/arrow-rs/pull/6232)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([jp0317](https://github.com/jp0317))
+- Specialize Prefix/Suffix Match for `Like/ILike` between Array and Scalar for
StringViewArray [\#6231](https://github.com/apache/arrow-rs/pull/6231)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([xinlifoobar](https://github.com/xinlifoobar))
+- fix: lexsort\_to\_indices should not fallback to non-lexical sort if the
datatype is not supported
[\#6225](https://github.com/apache/arrow-rs/pull/6225)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([viirya](https://github.com/viirya))
+- Modest improvement to FixedLenByteArray BYTE\_STREAM\_SPLIT arrow decoder
[\#6222](https://github.com/apache/arrow-rs/pull/6222)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Improve performance of `FixedLengthBinary` decoding
[\#6220](https://github.com/apache/arrow-rs/pull/6220)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Update documentation for Parquet BYTE\_STREAM\_SPLIT encoding
[\#6212](https://github.com/apache/arrow-rs/pull/6212)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Improve interval parsing
[\#6211](https://github.com/apache/arrow-rs/pull/6211)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- minor: Suggest take on interleave docs
[\#6210](https://github.com/apache/arrow-rs/pull/6210)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([gstvg](https://github.com/gstvg))
+- fix: Correctly handle take on dense union of a single selected type
[\#6209](https://github.com/apache/arrow-rs/pull/6209)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([gstvg](https://github.com/gstvg))
+- Add time dictionary coercions
[\#6208](https://github.com/apache/arrow-rs/pull/6208)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([adriangb](https://github.com/adriangb))
+- fix\(arrow\): restrict the range of temporal values produced via `data_gen`
[\#6205](https://github.com/apache/arrow-rs/pull/6205)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([kyle-mccarthy](https://github.com/kyle-mccarthy))
+- Add benchmarks for `BYTE_STREAM_SPLIT` encoded Parquet
`FIXED_LEN_BYTE_ARRAY` data
[\#6204](https://github.com/apache/arrow-rs/pull/6204)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Move `ParquetMetadataWriter` to its own module, update documentation
[\#6202](https://github.com/apache/arrow-rs/pull/6202)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- Add `ThriftMetadataWriter` for writing Parquet metadata
[\#6197](https://github.com/apache/arrow-rs/pull/6197)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([adriangb](https://github.com/adriangb))
+- Update zstd-sys requirement from \>=2.0.0, \<2.0.13 to \>=2.0.0, \<2.0.14
[\#6196](https://github.com/apache/arrow-rs/pull/6196)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([dependabot[bot]](https://github.com/apps/dependabot))
+- fix parquet type `is_optional` comments
[\#6192](https://github.com/apache/arrow-rs/pull/6192)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([jp0317](https://github.com/jp0317))
+- Remove duplicated statistics tests in parquet
[\#6190](https://github.com/apache/arrow-rs/pull/6190)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Kev1n8](https://github.com/Kev1n8))
+- Benchmarks for `bool_and`
[\#6189](https://github.com/apache/arrow-rs/pull/6189)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
+- Fix typo in documentation of Float64Array
[\#6188](https://github.com/apache/arrow-rs/pull/6188)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([mesejo](https://github.com/mesejo))
+- Make it clear that `StatisticsConverter` can not panic
[\#6187](https://github.com/apache/arrow-rs/pull/6187)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([alamb](https://github.com/alamb))
+- add filter benchmark for `FixedSizeBinaryArray`
[\#6186](https://github.com/apache/arrow-rs/pull/6186)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([chloro-pn](https://github.com/chloro-pn))
+- Update sysinfo requirement from 0.30.12 to 0.31.2
[\#6182](https://github.com/apache/arrow-rs/pull/6182)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([dependabot[bot]](https://github.com/apps/dependabot))
+- Add support for `StringView` and `BinaryView` statistics in
`StatisticsConverter` [\#6181](https://github.com/apache/arrow-rs/pull/6181)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Kev1n8](https://github.com/Kev1n8))
+- Support casting between BinaryView \<--\> Utf8 and LargeUtf8
[\#6180](https://github.com/apache/arrow-rs/pull/6180)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([xinlifoobar](https://github.com/xinlifoobar))
+- Implement specialized filter kernel for `FixedSizeByteArray`
[\#6178](https://github.com/apache/arrow-rs/pull/6178)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([chloro-pn](https://github.com/chloro-pn))
+- Support `StringView` and `BinaryView` in CDataInterface
[\#6171](https://github.com/apache/arrow-rs/pull/6171)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([a10y](https://github.com/a10y))
+- Optimize `take` kernel for `BinaryViewArray` and `StringViewArray`
[\#6168](https://github.com/apache/arrow-rs/pull/6168)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([a10y](https://github.com/a10y))
+- Support Parquet `BYTE_STREAM_SPLIT` for INT32, INT64, and
FIXED\_LEN\_BYTE\_ARRAY primitive types
[\#6159](https://github.com/apache/arrow-rs/pull/6159)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Fix comparison kernel benchmarks
[\#6147](https://github.com/apache/arrow-rs/pull/6147)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- improve `LIKE` regex performance up to 12x
[\#6145](https://github.com/apache/arrow-rs/pull/6145)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- Optimize `min_boolean` and `bool_and`
[\#6144](https://github.com/apache/arrow-rs/pull/6144)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([simonvandel](https://github.com/simonvandel))
+- Reduce bounds check in `RowIter`, add `unsafe Rows::row_unchecked`
[\#6142](https://github.com/apache/arrow-rs/pull/6142)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Minor: Simplify `StructArray` constructors
[\#6139](https://github.com/apache/arrow-rs/pull/6139)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([Rafferty97](https://github.com/Rafferty97))
+- Implement exponential block size growing strategy for `StringViewBuilder`
[\#6136](https://github.com/apache/arrow-rs/pull/6136)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([XiangpengHao](https://github.com/XiangpengHao))
+- Use `LevelHistogram` in `PageIndex`
[\#6135](https://github.com/apache/arrow-rs/pull/6135)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Add ArrowError::ArithmeticError
[\#6130](https://github.com/apache/arrow-rs/pull/6130)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([andygrove](https://github.com/andygrove))
+- Improve `LIKE` performance for "contains" style queries
[\#6128](https://github.com/apache/arrow-rs/pull/6128)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- Add `BooleanArray::new_from_packed` and `BooleanArray::new_from_u8`
[\#6127](https://github.com/apache/arrow-rs/pull/6127)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([chloro-pn](https://github.com/chloro-pn))
+- improvements to `(i)starts_with` and `(i)ends_with` performance
[\#6118](https://github.com/apache/arrow-rs/pull/6118)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([samuelcolvin](https://github.com/samuelcolvin))
+- Fix Clippy for the Rust 1.80 release
[\#6116](https://github.com/apache/arrow-rs/pull/6116)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
[[arrow-flight](https://github.com/apache/arrow-rs/labels/arrow-flight)]
([alamb](https://github.com/alamb))
+- added a flush method to IPC writers
[\#6108](https://github.com/apache/arrow-rs/pull/6108)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([V0ldek](https://github.com/V0ldek))
+- Add support for level histograms added in PARQUET-2261 to `ParquetMetaData`
[\#6105](https://github.com/apache/arrow-rs/pull/6105)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([etseidl](https://github.com/etseidl))
+- Implement date\_part for intervals
[\#6071](https://github.com/apache/arrow-rs/pull/6071)
[[arrow](https://github.com/apache/arrow-rs/labels/arrow)]
([nrc](https://github.com/nrc))
+- feat\(parquet\): Implement AsyncFileWriter for
`object_store::buffered::BufWriter`
[\#6013](https://github.com/apache/arrow-rs/pull/6013)
[[parquet](https://github.com/apache/arrow-rs/labels/parquet)]
([Xuanwo](https://github.com/Xuanwo))
diff --git a/Cargo.toml b/Cargo.toml
index bb7038a34b..3b274d5834 100644
--- a/Cargo.toml
+++ b/Cargo.toml
@@ -62,7 +62,7 @@ exclude = [
]
[workspace.package]
-version = "52.2.0"
+version = "53.0.0"
homepage = "https://github.com/apache/arrow-rs"
repository = "https://github.com/apache/arrow-rs"
authors = ["Apache Arrow <[email protected]>"]
@@ -77,20 +77,20 @@ edition = "2021"
rust-version = "1.62"
[workspace.dependencies]
-arrow = { version = "52.2.0", path = "./arrow", default-features = false }
-arrow-arith = { version = "52.2.0", path = "./arrow-arith" }
-arrow-array = { version = "52.2.0", path = "./arrow-array" }
-arrow-buffer = { version = "52.2.0", path = "./arrow-buffer" }
-arrow-cast = { version = "52.2.0", path = "./arrow-cast" }
-arrow-csv = { version = "52.2.0", path = "./arrow-csv" }
-arrow-data = { version = "52.2.0", path = "./arrow-data" }
-arrow-ipc = { version = "52.2.0", path = "./arrow-ipc" }
-arrow-json = { version = "52.2.0", path = "./arrow-json" }
-arrow-ord = { version = "52.2.0", path = "./arrow-ord" }
-arrow-row = { version = "52.2.0", path = "./arrow-row" }
-arrow-schema = { version = "52.2.0", path = "./arrow-schema" }
-arrow-select = { version = "52.2.0", path = "./arrow-select" }
-arrow-string = { version = "52.2.0", path = "./arrow-string" }
-parquet = { version = "52.2.0", path = "./parquet", default-features = false }
+arrow = { version = "53.0.0", path = "./arrow", default-features = false }
+arrow-arith = { version = "53.0.0", path = "./arrow-arith" }
+arrow-array = { version = "53.0.0", path = "./arrow-array" }
+arrow-buffer = { version = "53.0.0", path = "./arrow-buffer" }
+arrow-cast = { version = "53.0.0", path = "./arrow-cast" }
+arrow-csv = { version = "53.0.0", path = "./arrow-csv" }
+arrow-data = { version = "53.0.0", path = "./arrow-data" }
+arrow-ipc = { version = "53.0.0", path = "./arrow-ipc" }
+arrow-json = { version = "53.0.0", path = "./arrow-json" }
+arrow-ord = { version = "53.0.0", path = "./arrow-ord" }
+arrow-row = { version = "53.0.0", path = "./arrow-row" }
+arrow-schema = { version = "53.0.0", path = "./arrow-schema" }
+arrow-select = { version = "53.0.0", path = "./arrow-select" }
+arrow-string = { version = "53.0.0", path = "./arrow-string" }
+parquet = { version = "53.0.0", path = "./parquet", default-features = false }
chrono = { version = "0.4.34", default-features = false, features = ["clock"] }
diff --git a/dev/release/update_change_log.sh b/dev/release/update_change_log.sh
index 20d7e60aa4..5c4fcb53ad 100755
--- a/dev/release/update_change_log.sh
+++ b/dev/release/update_change_log.sh
@@ -29,8 +29,8 @@
set -e
-SINCE_TAG="52.1.0"
-FUTURE_RELEASE="52.2.0"
+SINCE_TAG="52.2.0"
+FUTURE_RELEASE="53.0.0"
SOURCE_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
SOURCE_TOP_DIR="$(cd "${SOURCE_DIR}/../../" && pwd)"