AlenkaF commented on code in PR #418:
URL: https://github.com/apache/arrow-site/pull/418#discussion_r1360226790


##########
_posts/2023-10-11-14.0.0-release.md:
##########
@@ -0,0 +1,132 @@
+---
+layout: post
+title: "Apache Arrow 14.0.0 Release"
+date: "2023-10-11 00:00:00"
+author: pmc
+categories: [release]
+---
+<!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements.  See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License.  You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+
+The Apache Arrow team is pleased to announce the 14.0.0 release. This covers
+over 3 months of development work and includes [**XXX resolved issues**][1]
+from [**YYY distinct contributors**][2]. See the [Install 
Page](https://arrow.apache.org/install/)
+to learn how to get the libraries for your platform.
+
+The release notes below are not exhaustive and only expose selected highlights
+of the release. Many other bugfixes and improvements have been made: we refer
+you to the [complete changelog][3].
+
+## Community
+
+Since the 13.0.0 release, Metehan Yildirim and Oleks V. have been invited to 
be committers.
+
+Thanks for your contributions and participation in the project!
+
+## Columnar Format Notes
+
+A `VariableShapeTensorType` was added to the Arrow specification as a 
canonical extension type. 
([GH-24868](https://github.com/apache/arrow/issues/24868)).
+
+Motivated by recent innovations in DuckDB and Meta's Velox engine, new "view" 
data types were added to the Arrow columnar format spec. 
+
+* 16-byte StringView and BinaryView data type which enables better buffer 
reuse, faster "false" string comparisons (due to maintaining a prefix) and 
short string inlining. 
([GH-35627](https://github.com/apache/arrow/issues/35627)).
+* ListView and LargeListView types for more performant "out-of-order" building 
and processing of lists and better buffer reuse 
([GH-37876](https://github.com/apache/arrow/issues/37876)).
+
+## Arrow Flight RPC notes
+
+A new RPC method was added to allow polling for completion in long-running 
queries as an alternative to the blocking GetFlightInfo call 
([GH-36155](https://github.com/apache/arrow/issues/36155)). Also, 
`app_metadata` was added to `FlightInfo` and `FlightEndpoint` 
([GH-37635](https://github.com/apache/arrow/issues/37635)).
+
+In C++ and Python, an experimental asynchronous GetFlightInfo call was added 
to the client-side API 
([GH-36512](https://github.com/apache/arrow/issues/36512)). `ServerCallContext` 
now exposes conveniences to send headers/trailers without having to use 
middleware ([GH-36952](https://github.com/apache/arrow/issues/36952)). The 
implementation was fixed to not reject unknown field tags to enable 
interoperability with future versions of Flight that could add new fields 
([GH-36975](https://github.com/apache/arrow/issues/36975)). The CMake 
configuration was fixed to correctly require linking to Arrow Flight RPC when 
using Arrow Flight SQL 
([GH-37406](https://github.com/apache/arrow/issues/37406)). 
+
+In Go, the underlying generated Protobuf code is now exposed for easier 
low-level integrations with Flight 
([GH-36893](https://github.com/apache/arrow/issues/36893)). 
+
+In Java, the stateful "login" authentication APIs using the Handshake RPC are 
deprecated; it will not be removed, but it should not be used unless you 
specifically want the old behavior 
([GH-37722](https://github.com/apache/arrow/issues/37722)). Utilities were 
added to help implement basic Flight SQL services for unit testing 
([GH-37795](https://github.com/apache/arrow/issues/37795)).
+
+## C++ notes
+
+## C# notes
+
+## Go notes
+
+## Java notes
+
+Java 21 is enabled and validated in CI 
([GH-37914](https://github.com/apache/arrow/issues/37914)).
+
+The Gandiva module implemented a breaking change by moving `Types.proto` into 
a subfolder ([GH-37893](https://github.com/apache/arrow/issues/37893)).
+
+`DefaultVectorComparators` added support for `LargeVarCharVector`, 
`LargeVarBinaryVector` 
([GH-25659](https://github.com/apache/arrow/issues/25659)) and for `BitVector`, 
`DateDayVector`, `DateMilliVector`
+`Decimal256Vector`, `DecimalVector`, `DurationVector`, `IntervalDayVector`, 
`TimeMicroVector`, `TimeMilliVector`, `TimeNanoVector`, `TimeSecVector`, 
`TimeStampVector` ([GH-37701](https://github.com/apache/arrow/issues/37701)).
+
+A bug was fixed in `VectorAppender` to prevent resizing the data buffer twice 
when appending variable-length vectors 
([GH-37829](https://github.com/apache/arrow/issues/37829)).
+
+`VarCharWriter` added support for writing from `Text` and `String` 
([GH-37706](https://github.com/apache/arrow/issues/37706)). `VarBinaryWriter` 
added support for writing from `byte[]` and `ByteBuffer` 
([GH-37705](https://github.com/apache/arrow/issues/37705)).
+
+The JDBC driver will now ignore username and password authentication if a 
token is provided ([GH-37073](https://github.com/apache/arrow/issues/37073)).
+
+A bug was fixed in the Java C-Data interface when importing a vector with an 
empty array ([GH-37056](https://github.com/apache/arrow/issues/37056)).
+
+A bug was fixed in the S3 file system implementation when closing the 
connection ([GH-36069](https://github.com/apache/arrow/issues/36069)).
+
+Arrow datasets now support Substrait `ExtendedExpression`s as inputs to filter 
and project operations 
([GH-34252](https://github.com/apache/arrow/issues/34252)).
+
+## JavaScript notes
+
+* GH-21815: [JS] Add support for Duration type #37341
+* GH-31621: [JS] Fix Union null bitmaps #37122
+
+## Python notes

Review Comment:
   ```suggestion
   ## Python notes
   
   Compatibility notes:
   * Support for Python 3.12 was added 
[GH-37880](https://github.com/apache/arrow/issues/37880)
   * Support for Cython 3 was added 
[GH-37742](https://github.com/apache/arrow/issues/37742)
   * PyArrow is now compatible with numpy 2.0 
[GH-37574](https://github.com/apache/arrow/issues/37574)
   * `pyarrow.compute.CumulativeSumOptions` has been deprecated, use 
`pyarrow.compute.CumulativeOptions` instead 
[GH-36240](https://github.com/apache/arrow/issues/36240)
   
   New features:
   * Allow type promotion added on `pyarrow.concat_tables` 
[GH-36845](https://github.com/apache/arrow/issues/36845)
   * Support for vector function UDF was added 
[GH-36672](https://github.com/apache/arrow/issues/36672)
   
   Other improvements:
   * `pyarrow.MapScalar.as_py`can now be called with custom field name 
[GH-36809](https://github.com/apache/arrow/issues/36809)
   * The default of `pre_buffer` is now set to `True` for reading Parquet when 
using `pyarrow.dataset` directly. This can give significant speed-up on 
filesystems like S3 and is now aligned to `pyarrow.parquet.read_table` 
interface [GH-36765](https://github.com/apache/arrow/issues/36765)
   * Path to timezone database can now be set through python API 
([GH-35600](https://github.com/apache/arrow/issues/35600), [GH-38145] 
(https://github.com/apache/arrow/issues/38145))
   
   Relevant bug fixes:
   * String to date cast kernel was added to fix python scalar cast regression 
[GH-37411](https://github.com/apache/arrow/issues/37411)
   * Fix conversion from Python to Arrow when chunking large nested structs 
[GH-32439](https://github.com/apache/arrow/issues/32439)
   * Fix segfault when passing table as argument to `pyarrow.Table.filter` 
[GH-37650](https://github.com/apache/arrow/issues/37650)
   * `use_threads` keyword was added to the `group_by` method on 
`pyarrow.Table` which gets passed through to the 
`pyarrow.acero.Declaration.to_table` call. Specifing `use_threads=False`allows 
to get stable ordering of the output 
[GH-36709](https://github.com/apache/arrow/issues/36709)
   * Fix printable representation for `pyarrow.TimestampScalar` when values are 
outside datetime range [GH-36323](https://github.com/apache/arrow/issues/36323)
   * Empty dataframes with zero chunks can now be consumed by the Dataframe 
Interchange Protocol implementation 
[GH-37050](https://github.com/apache/arrow/issues/37050) 
   * Fix dtype information for categorical columns in the Dataframe Interchange 
Protocol implementation [GH-38034](https://github.com/apache/arrow/issues/38034)
   * Boolean columns with bitsize 1 are now supported in `from_dataframe`of the 
Dataframe Interchange Protocol 
[GH-37145](https://github.com/apache/arrow/issues/37145)
   
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to