AlenkaF commented on code in PR #418:
URL: https://github.com/apache/arrow-site/pull/418#discussion_r1360226790
##########
_posts/2023-10-11-14.0.0-release.md:
##########
@@ -0,0 +1,132 @@
+---
+layout: post
+title: "Apache Arrow 14.0.0 Release"
+date: "2023-10-11 00:00:00"
+author: pmc
+categories: [release]
+---
+<!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+
+The Apache Arrow team is pleased to announce the 14.0.0 release. This covers
+over 3 months of development work and includes [**XXX resolved issues**][1]
+from [**YYY distinct contributors**][2]. See the [Install
Page](https://arrow.apache.org/install/)
+to learn how to get the libraries for your platform.
+
+The release notes below are not exhaustive and only expose selected highlights
+of the release. Many other bugfixes and improvements have been made: we refer
+you to the [complete changelog][3].
+
+## Community
+
+Since the 13.0.0 release, Metehan Yildirim and Oleks V. have been invited to
be committers.
+
+Thanks for your contributions and participation in the project!
+
+## Columnar Format Notes
+
+A `VariableShapeTensorType` was added to the Arrow specification as a
canonical extension type.
([GH-24868](https://github.com/apache/arrow/issues/24868)).
+
+Motivated by recent innovations in DuckDB and Meta's Velox engine, new "view"
data types were added to the Arrow columnar format spec.
+
+* 16-byte StringView and BinaryView data type which enables better buffer
reuse, faster "false" string comparisons (due to maintaining a prefix) and
short string inlining.
([GH-35627](https://github.com/apache/arrow/issues/35627)).
+* ListView and LargeListView types for more performant "out-of-order" building
and processing of lists and better buffer reuse
([GH-37876](https://github.com/apache/arrow/issues/37876)).
+
+## Arrow Flight RPC notes
+
+A new RPC method was added to allow polling for completion in long-running
queries as an alternative to the blocking GetFlightInfo call
([GH-36155](https://github.com/apache/arrow/issues/36155)). Also,
`app_metadata` was added to `FlightInfo` and `FlightEndpoint`
([GH-37635](https://github.com/apache/arrow/issues/37635)).
+
+In C++ and Python, an experimental asynchronous GetFlightInfo call was added
to the client-side API
([GH-36512](https://github.com/apache/arrow/issues/36512)). `ServerCallContext`
now exposes conveniences to send headers/trailers without having to use
middleware ([GH-36952](https://github.com/apache/arrow/issues/36952)). The
implementation was fixed to not reject unknown field tags to enable
interoperability with future versions of Flight that could add new fields
([GH-36975](https://github.com/apache/arrow/issues/36975)). The CMake
configuration was fixed to correctly require linking to Arrow Flight RPC when
using Arrow Flight SQL
([GH-37406](https://github.com/apache/arrow/issues/37406)).
+
+In Go, the underlying generated Protobuf code is now exposed for easier
low-level integrations with Flight
([GH-36893](https://github.com/apache/arrow/issues/36893)).
+
+In Java, the stateful "login" authentication APIs using the Handshake RPC are
deprecated; it will not be removed, but it should not be used unless you
specifically want the old behavior
([GH-37722](https://github.com/apache/arrow/issues/37722)). Utilities were
added to help implement basic Flight SQL services for unit testing
([GH-37795](https://github.com/apache/arrow/issues/37795)).
+
+## C++ notes
+
+## C# notes
+
+## Go notes
+
+## Java notes
+
+Java 21 is enabled and validated in CI
([GH-37914](https://github.com/apache/arrow/issues/37914)).
+
+The Gandiva module implemented a breaking change by moving `Types.proto` into
a subfolder ([GH-37893](https://github.com/apache/arrow/issues/37893)).
+
+`DefaultVectorComparators` added support for `LargeVarCharVector`,
`LargeVarBinaryVector`
([GH-25659](https://github.com/apache/arrow/issues/25659)) and for `BitVector`,
`DateDayVector`, `DateMilliVector`
+`Decimal256Vector`, `DecimalVector`, `DurationVector`, `IntervalDayVector`,
`TimeMicroVector`, `TimeMilliVector`, `TimeNanoVector`, `TimeSecVector`,
`TimeStampVector` ([GH-37701](https://github.com/apache/arrow/issues/37701)).
+
+A bug was fixed in `VectorAppender` to prevent resizing the data buffer twice
when appending variable-length vectors
([GH-37829](https://github.com/apache/arrow/issues/37829)).
+
+`VarCharWriter` added support for writing from `Text` and `String`
([GH-37706](https://github.com/apache/arrow/issues/37706)). `VarBinaryWriter`
added support for writing from `byte[]` and `ByteBuffer`
([GH-37705](https://github.com/apache/arrow/issues/37705)).
+
+The JDBC driver will now ignore username and password authentication if a
token is provided ([GH-37073](https://github.com/apache/arrow/issues/37073)).
+
+A bug was fixed in the Java C-Data interface when importing a vector with an
empty array ([GH-37056](https://github.com/apache/arrow/issues/37056)).
+
+A bug was fixed in the S3 file system implementation when closing the
connection ([GH-36069](https://github.com/apache/arrow/issues/36069)).
+
+Arrow datasets now support Substrait `ExtendedExpression`s as inputs to filter
and project operations
([GH-34252](https://github.com/apache/arrow/issues/34252)).
+
+## JavaScript notes
+
+* GH-21815: [JS] Add support for Duration type #37341
+* GH-31621: [JS] Fix Union null bitmaps #37122
+
+## Python notes
Review Comment:
```suggestion
## Python notes
Compatibility notes:
* Support for Python 3.12 was added
[GH-37880](https://github.com/apache/arrow/issues/37880)
* Support for Cython 3 was added
[GH-37742](https://github.com/apache/arrow/issues/37742)
* PyArrow is now compatible with numpy 2.0
[GH-37574](https://github.com/apache/arrow/issues/37574)
* `pyarrow.compute.CumulativeSumOptions` has been deprecated, use
`pyarrow.compute.CumulativeOptions` instead
[GH-36240](https://github.com/apache/arrow/issues/36240)
New features:
* Allow type promotion added on `pyarrow.concat_tables`
[GH-36845](https://github.com/apache/arrow/issues/36845)
* Support for vector function UDF was added
[GH-36672](https://github.com/apache/arrow/issues/36672)
Other improvements:
* `pyarrow.MapScalar.as_py`can now be called with custom field name
[GH-36809](https://github.com/apache/arrow/issues/36809)
* The default of `pre_buffer` is now set to `True` for reading Parquet when
using `pyarrow.dataset` directly. This can give significant speed-up on
filesystems like S3 and is now aligned to `pyarrow.parquet.read_table`
interface [GH-36765](https://github.com/apache/arrow/issues/36765)
* Path to timezone database can now be set through python API
([GH-35600](https://github.com/apache/arrow/issues/35600), [GH-38145]
(https://github.com/apache/arrow/issues/38145))
Relevant bug fixes:
* String to date cast kernel was added to fix python scalar cast regression
[GH-37411](https://github.com/apache/arrow/issues/37411)
* Fix conversion from Python to Arrow when chunking large nested structs
[GH-32439](https://github.com/apache/arrow/issues/32439)
* Fix segfault when passing table as argument to `pyarrow.Table.filter`
[GH-37650](https://github.com/apache/arrow/issues/37650)
* `use_threads` keyword was added to the `group_by` method on
`pyarrow.Table` which gets passed through to the
`pyarrow.acero.Declaration.to_table` call. Specifing `use_threads=False`allows
to get stable ordering of the output
[GH-36709](https://github.com/apache/arrow/issues/36709)
* Fix printable representation for `pyarrow.TimestampScalar` when values are
outside datetime range [GH-36323](https://github.com/apache/arrow/issues/36323)
* Empty dataframes with zero chunks can now be consumed by the Dataframe
Interchange Protocol implementation
[GH-37050](https://github.com/apache/arrow/issues/37050)
* Fix dtype information for categorical columns in the Dataframe Interchange
Protocol implementation [GH-38034](https://github.com/apache/arrow/issues/38034)
* Boolean columns with bitsize 1 are now supported in `from_dataframe`of the
Dataframe Interchange Protocol
[GH-37145](https://github.com/apache/arrow/issues/37145)
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]