mrkn commented on a change in pull request #178:
URL: https://github.com/apache/arrow-site/pull/178#discussion_r790217861



##########
File path: _posts/2022-01-19-7.0.0-release.md
##########
@@ -0,0 +1,227 @@
+---
+layout: post
+title: "Apache Arrow 7.0.0 Release"
+date: "2022-01-19 00:00:00 -0600"
+author: pmc
+categories: [release]
+---
+<!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements.  See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License.  You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+
+The Apache Arrow team is pleased to announce the 7.0.0 release. This covers
+over 3 months of development work and includes [**474 resolved issues**][1]
+from [**?? distinct contributors**][2]. See the Install Page to learn how to
+get the libraries for your platform.
+
+The release notes below are not exhaustive and only expose selected highlights
+of the release. Many other bugfixes and improvements have been made: we refer
+you to the [complete changelog][3].
+
+## Community
+
+Since the 6.0.1 release, Rémi Dattai and Alessandro Molina have been invited 
to be committers.
+Daniël Heres and Yibo Cai have joined the Project Management Committee (PMC).
+Thanks for your contributions and participation in the project!
+
+## Columnar Format Notes
+
+TBD
+
+## Arrow Flight RPC notes
+
+TBD
+
+## C++ notes
+
+A set of CMake presets has been added to ease building Arrow in a number
+of cases (ARROW-14678, ARROW-14714).
+
+The `arrow::BitUtil` namespace has been renamed to `arrow::bit_util`
+(ARROW-13494).
+
+Concatenation of union arrays is now supported (ARROW-4975).
+
+`StructType` gained three convenience methods to add, change and remove
+a given field (ARROW-11424).
+
+The `Datum` kind `COLLECTION` has been removed as it was entirely unused
+in the codebase (ARROW-13598).
+
+### Compute Layer
+
+A number of compute functions have been added:
+
+- functions operating on strings: "binary_reverse" (ARROW-14306),
+  "string_repeat" (ARROW-12712), "utf8_normalize" (ARROW-14205);
+- "fill_null_forward", "fill_null_backward" (ARROW-1699);
+- "ceil_temporal", "floor_temporal", "round_temporal" to adjust temporal input
+  to an integral multiple of a given unit (ARROW-14822);
+- "year_month_day" to extract the calendar components of the input 
(ARROW-15032);
+- "random" to general random floating-point values between 0 and 1 
(ARROW-12404);
+- "indices_nonzero" to return the indices in the input where there are
+  non-zero, non-null values (ARROW-13035).
+
+Decimal data is now supported as input of the arithmetic kernels
+(ARROW-13130).
+
+Dictionary data is now supported as input of the hash join execution node
+(ARROW-14181).
+
+Residual predicates have been implemented in the hash join node
+(ARROW-13643).
+
+The "list_parent_indices" function now always returns int64 data
+regardless of the input type (ARROW-14592).
+
+Month-day-nano interval data is now supported as input of the same functions
+as other interval types (ARROW-13989).
+
+### CSV
+
+The CSV writer got additional configuration options:
+- the string representation of null values (ARROW-14905);
+- the quoting strategy: always / never / as needed (ARROW-14905);
+- the end of line character(s) (ARROW-14907)
+
+### Dataset Layer
+
+"SkyHook", a dataset addition that offloads fragment scan operations to a
+Ceph distributed storage, was contributed (ARROW-13607).
+
+The dataset writer now exposes options `min_rows_per_group` and
+`max_rows_per_group` to control the size of row groups created (ARROW-14426).
+
+### IO and Filesystem Layer
+
+A critical bug in the AWS SDK for C++ that risks losing data in S3 multipart
+uploads has been circumvented (ARROW-14523).
+
+The Google Cloud Storage filesystem is now featureful enough to pass all
+generic filesystem tests (ARROW-14924).
+
+The OpenAppendStream method of filesystems has been un-deprecated; however,
+it still cannot be implemented for all filesystem backends (ARROW-14969).
+
+A new function `arrow::fs::ResolveS3BucketRegion` allows resolving the
+region where a particular S3 bucket resides (ARROW-15165).
+
+The S3 filesystem now sets the Content-Type of output files to
+"application/octet-stream" (instead of "application/xml" previously)
+if not explicitly specified by the caller (ARROW-15306).
+
+### IPC
+
+Fine-grained I/O (coalescing) is now enabled in the synchronous (ARROW-12683)
+and asynchronous (ARROW-14577) IPC reader.
+
+It is now possible to set the compression level when using LZ4 compression
+(ARROW-9648).
+
+### Parquet
+
+DELTA_BYTE_ARRAY-encoded data can now be read from (but not written to)
+bytearray columns in Parquet files (PARQUET-492).
+
+## C# notes
+
+TBD
+
+## Go notes
+
+### Bug Fixes
+
+TBD
+
+### Enhancements
+
+TBD
+
+## Java notes
+
+TBD
+
+## JavaScript notes
+
+TBD
+
+## Python notes
+
+TBD
+
+## R notes
+
+TBD
+
+## Ruby and C GLib notes
+
+### Ruby
+
+There are two new contributors @okadakk and @simpl1g .
+
+The updates of Red Arrow consists of the following improvements:
+- Addition of a [cookbook section](https://github.com/apache/arrow/pull/11584) 
to the [main repository](https://github.com/apache/arrow/tree/master/ruby)
+- Improved [Arrow::Function#execute 
useability](https://github.com/apache/arrow/pull/12101)
+- Support for [table joins](https://github.com/apache/arrow/pull/12108)
+- A [MacOS Development guide](https://github.com/apache/arrow/pull/11181)
+- Enable separate use of [min and 
max](https://github.com/apache/arrow/pull/11179)
+- Add support for loading [Arrow::Table from a 
URI](https://github.com/apache/arrow/pull/12128)

Review comment:
       ```suggestion
   ### Ruby
   
   There are two new contributors @okadakk and @simpl1g .
   
   The updates of Red Arrow consists of the following improvements:
   
   - `Arrow::Function#execute` now accepts an instance of an `Arrow::Column` as 
its argument [(ARROW-14551)](https://issues.apache.org/jira/browse/ARROW-14551)
   - `Arrow::Table.load` now supports `.arrows` files to load 
[(ARROW-15356)](https://issues.apache.org/jira/browse/ARROW-15356)
   - Add support loading `Arrow::Table` by a `URI` in `Arrow::Table.load` 
[(ARROW-14562)](https://issues.apache.org/jira/browse/ARROW-14562)
   - `Arrow::Table` now supports to join two tables 
[(ARROW-14531)](https://issues.apache.org/jira/browse/ARROW-14531)
   - `Arrow::Function#execute` gets more easier to use than before 
[(ARROW-15274)](https://issues.apache.org/jira/browse/ARROW-15274)
   - `Arrow::SortKey#name` has been renamed to `Arrow::SortKey#target` 
[(ARROW-14784)](https://issues.apache.org/jira/browse/ARROW-14784)
   - Add Cookbook section to documentation 
[(ARROW-14636)](https://issues.apache.org/jira/browse/ARROW-14636)
   - Support the explicit initialization of S3 API by the `Arrow.s3_initialize` 
method [(ARROW-14637)](https://issues.apache.org/jira/browse/ARROW-14637)
   - On macOS, stop specifying the version of openssl package explicitly when 
building the extension library 
[(ARROW-14619)](https://issues.apache.org/jira/browse/ARROW-14619)
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to