pitrou commented on code in PR #418:
URL: https://github.com/apache/arrow-site/pull/418#discussion_r1360306746
##########
_posts/2023-10-11-14.0.0-release.md:
##########
@@ -0,0 +1,132 @@
+---
+layout: post
+title: "Apache Arrow 14.0.0 Release"
+date: "2023-10-11 00:00:00"
+author: pmc
+categories: [release]
+---
+<!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+
+The Apache Arrow team is pleased to announce the 14.0.0 release. This covers
+over 3 months of development work and includes [**XXX resolved issues**][1]
+from [**YYY distinct contributors**][2]. See the [Install
Page](https://arrow.apache.org/install/)
+to learn how to get the libraries for your platform.
+
+The release notes below are not exhaustive and only expose selected highlights
+of the release. Many other bugfixes and improvements have been made: we refer
+you to the [complete changelog][3].
+
+## Community
+
+Since the 13.0.0 release, Metehan Yildirim and Oleks V. have been invited to
be committers.
+
+Thanks for your contributions and participation in the project!
+
+## Columnar Format Notes
+
+A `VariableShapeTensorType` was added to the Arrow specification as a
canonical extension type.
([GH-24868](https://github.com/apache/arrow/issues/24868)).
+
+Motivated by recent innovations in DuckDB and Meta's Velox engine, new "view"
data types were added to the Arrow columnar format spec.
+
+* 16-byte StringView and BinaryView data type which enables better buffer
reuse, faster "false" string comparisons (due to maintaining a prefix) and
short string inlining.
([GH-35627](https://github.com/apache/arrow/issues/35627)).
+* ListView and LargeListView types for more performant "out-of-order" building
and processing of lists and better buffer reuse
([GH-37876](https://github.com/apache/arrow/issues/37876)).
+
+## Arrow Flight RPC notes
+
+A new RPC method was added to allow polling for completion in long-running
queries as an alternative to the blocking GetFlightInfo call
([GH-36155](https://github.com/apache/arrow/issues/36155)). Also,
`app_metadata` was added to `FlightInfo` and `FlightEndpoint`
([GH-37635](https://github.com/apache/arrow/issues/37635)).
+
+In C++ and Python, an experimental asynchronous GetFlightInfo call was added
to the client-side API
([GH-36512](https://github.com/apache/arrow/issues/36512)). `ServerCallContext`
now exposes conveniences to send headers/trailers without having to use
middleware ([GH-36952](https://github.com/apache/arrow/issues/36952)). The
implementation was fixed to not reject unknown field tags to enable
interoperability with future versions of Flight that could add new fields
([GH-36975](https://github.com/apache/arrow/issues/36975)). The CMake
configuration was fixed to correctly require linking to Arrow Flight RPC when
using Arrow Flight SQL
([GH-37406](https://github.com/apache/arrow/issues/37406)).
+
+In Go, the underlying generated Protobuf code is now exposed for easier
low-level integrations with Flight
([GH-36893](https://github.com/apache/arrow/issues/36893)).
+
+In Java, the stateful "login" authentication APIs using the Handshake RPC are
deprecated; it will not be removed, but it should not be used unless you
specifically want the old behavior
([GH-37722](https://github.com/apache/arrow/issues/37722)). Utilities were
added to help implement basic Flight SQL services for unit testing
([GH-37795](https://github.com/apache/arrow/issues/37795)).
+
+## C++ notes
Review Comment:
```suggestion
## C++ notes
Experimental APIs for exporting and importing non-CPU arrays using the C
Device Data Interface
have been added (GH-36488), together with an experimental API for device
synchronization
(GH-36103).
Initial compatibility with Emscripten without threading support has been
added (GH-35176).
### Compute layer
New compute functions:
* a `cumulative_mean` function on numeric data (GH-36931);
Improved compute functions:
* rounding functions now work natively on integer inputs instead of casting
them to floats (GH-35273);
* the `divide` function now supports duration inputs (GH-36789);
* `take` and `filter` now support sparse unions in addition to dense unions
(GH-36905);
* `if_else`, `coalesce`, `choose` and `case_when` now support duration
inputs (GH-37028);
* casting between fixed-size lists and variable-size lists is now supported
(GH-20086);
* casting from strings to dates is now supported (GH-37411);
* `mean` on integer inputs now uses a floating-point representation for its
intermediate sum,
avoiding integer overflow on large inputs (GH-34909);
### Datasets
Support for writing encrypted Parquet datasets has been added (GH-29238).
### Gandiva
Gandiva now supports linking dynamically to LLVM on non-Windows platforms
(GH-37410).
Previously, Gandiva would always link LLVM statically into `libgandiva`.
### Parquet
RLE is used by default when encoding boolean values if v2 data pages are
enabled
(GH-36882).
Page indexes can now be encrypted as per the specification (GH-34950).
A bug in the DELTA_BINARY_PACKED encoder leading to suboptimal column sizes
was fixed (GH-37939).
### Substrait
It is now possible to serialize and deserialize individual expressions using
Substrait,
not only full query plans (GH-33985).
### Miscellaneous
A new `CodecOptions` class allows customizing compression parameters
per-codec (GH-35287).
The environment variable `AWS_ENDPOINT_URL` is now respected when resolving
S3 URIs (GH-36770).
Recursively listing S3 filesystem trees should now issue less requests,
leading to improved performance (GH-34213).
Comparing a `ChunkedArray` to itself now behaves correctly with NaN values
(GH-37515).
The use of BMI2 instructions on x86 was incorrectly guarded. Those
instructions
could be executed on platforms without BMI2 support, leading to crashes
(GH-37017).
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]