This is an automated email from the ASF dual-hosted git repository.
romainfrancois pushed a change to branch ARROW_13112_altrep_strings
in repository https://gitbox.apache.org/repos/asf/arrow.git.
discard 841e241 lint
discard 24256d0 use static methods in AltrepArrayString
discard 53cc439 unused
add b0d89db ARROW-13680: [C++] Create an asynchronous nursery to simplify
capture logic
add 4b5ed4e ARROW-13138: [C++][R] Implement extract temporal components
(year, month, day, etc) from date32/64 types
add bb1ef85 ARROW-13033: [C++] Kernel to localize naive timestamps to a
timezone (preserving clock-time)
add 9aee524 ARROW-11885: [R] Turn off some capabilities when
LIBARROW_MINIMAL=true
add 0c41e0b ARROW-13842: [C++] Bump vendored date library
add 946bdcf ARROW-13963: [Go] Minor: Add bitmap reader/writer impl from
go Parquet module to Arrow Bitutil package
add 4fe6fae ARROW-13961: [C++] Fix use of non-const references,
declaration without initialization
add 66d7dd4 ARROW-13962: [R] Catch up on the NEWS
add 04515de MINOR: [R] Exclude some paths from the cpp rsync
add 56411f5 ARROW-13940: [R] Turn on multithreading with Arrow engine
queries
add 42d10c3 ARROW-13964: MINOR: [Go][Parquet] remove base bitmap
reader/writer from parquet module, use arrow bitutil ones
add 3bbec3f ARROW-13942: [Dev] Update cmake_format usage in autotune
comment bot
add 3db4854 ARROW-13778: [R] Handle complex summarize expressions
add fa7cff6 ARROW-1565: [C++] Implement TopK/BottomK
add bae7e2b MINOR: [Doc][Python] Fix typo ParquetFileForma (#11137)
add db5b848 ARROW-13979: [Go] Enable -race for go tests
add c091e6d ARROW-13859: [Java] Add code coverage support
add e8ab3ae ARROW-13733 [Java]: Allow JDBC adapters to reuse vector
schema roots
add 1049dde ARROW-13544 [Java]: Remove APIs that have been deprecated for
long (Changes to ArrowBuf)
add 74f020d ARROW-13974: [C++] Resolve follow-up reviews for TopK/BottomK
add 293f856 ARROW-13966: [C++] Support decimals in comparisons
add 9122149 ARROW-13937: [C++][Compute] Add explicit output values to
sign function and fix unary type checks
add f2cb977 ARROW-13646: [Go][Parquet] adding the parquet metadata package
add dfaa415 ARROW-13983: [C++] Avoid raising error if fadvise() isn't
supported
add 0610998 ARROW-13978: [C++] Bump gtest to 1.11 to unbreak builds with
recent clang
add 52904d6 ARROW-13958: [Python] Migrate Python ORC bindings to use new
Result-based APIs
add 376cb45 ARROW-12744: [C++][Compute] Add rounding kernel
add 87b2fcd ARROW-12087: [C++] Allow sorting durations, timestamps with
timezones
add 1cbc4a2 ARROW-13904: [R] Implement ModeOptions
add f3d3c68 ARROW-13905: [R] Implement ReplaceSliceOptions
add 0b6f531 ARROW-13906: [R] Implement PartitionNthOptions
add 672149b ARROW-13869: [R] Implement options for non-bound
MatchSubstringOptions kernels
add 8875d5c ARROW-13908: [R] Implement ExtractRegexOptions
add b06e3b6 ARROW-13626: [R] Bindings for log base b
add f8f2617 ARROW-13777: [R] mutate after group_by should be ok as long
as there are only scalar functions
add a3097af ARROW-13976: [C++] Add path to libjvm.so in ARM CPU
add 9cf2372 ARROW-13959: [R] Update tests for extracting components from
date32 objects
add b05b169 ARROW-13996: [Go][Parquet] Fix file offsets in go impl
add 3dcf31c ARROW-13699: [Python][Docs] Improve filesystem documentation
add 44801ec ARROW-13982: [C++] Don't stall in async scanner if a fragment
generates no batches
add 2fb8201 ARROW-13877: [C++] Support FixedSizeList in generic list
kernels
add 1f7ee39 ARROW-13897: [Python] Correct TimestampScalar.as_py() and
DurationScalar.as_py() docstrings
add 012248a ARROW-12657: [C++] Adding String hex to numeric conversion
add 1d139fe ARROW-13881: [C++][FlightRPC][Packaging] Ensure Flight is
packaged with advanced TLS options on Windows
add 03bf88b ARROW-14005: [R] Fix tests for PartitionNthOptions so that
can run on various platformsFix partition_nth_indices test
add 1919c33 ARROW-13594: [CI] Temporarily disable turbodbc integration
tests in nightly builds
add 2c405da ARROW-13882: [C++] Improve min_max/hash_min_max type support
add 6d4d700 ARROW-14007: [C++] Fix compiler warnings in decimal promotion
helper
add 80e0e30 ARROW-13785: [C++] Add methods to print exec nodes/plans
add 1130d18 ARROW-13999: [C++] Fix bundled LZ4 build on MinGW
add 5260fd5 ARROW-13965: [C++] dynamic_casts in parquet
TypedColumnWriterImpl impacting performance
add b599a05 ARROW-13849: [C++] Wrap min_max with min/max functions
add 0dd68c2 MINOR: [R] Remove a lintr warning in r/R/util.R
add d01d598 ARROW-14003: [C++][Python] Not providing a sort_key in the
"select_k_unstable" kernel crashes
add 6cff6a3 ARROW-13816: [Go][C] Implement Consumer APIs for C Data
Interface in Go
add 4131fde ARROW-14001: [Go] Fixing AppendBoolean function in
BitmapWriter
add 7bf5609 ARROW-14009: [C++] Seed parallellism in SourceNode
add e5f3e04 ARROW-14016: [C++] Wrong type_name used for directory
partitioning
add cafdb12 ARROW-14030: [GLib] Use arrow::Result based ORC API
add be79766 ARROW-14031: [Ruby] Use min and max separately
add c57d7f3 ARROW-14023: [Ruby] Arrow::Table#slice accepts Hash
add a09f205 ARROW-14033: [Ruby][Doc] Add macOS development guide for Red
Arrow
add d72d3f3 ARROW-13654: [C++][Parquet] Avoid infinite loop when
appending a FileMetaData to itself
add 7128b5e ARROW-14002: [Python] Support tuples in unify_schemas
add 1bef74b ARROW-12673: [C++] Add callback to handle incorrect column
counts
add 33c45e4 ARROW-13550: [R] Support .groups argument to
dplyr::summarize()
add bdb2f74 ARROW-12669:[C++][Python] Implement a new scalar function:
list_element
add b29c302 ARROW-14033: [Ruby] Append OpenSSL's .pc path automatically
on macOS with Homebrew
add 3ddd3b6 ARROW-14013: [C++][Docs] Add instructions for Fedora
add 6a6b464 ARROW-13990: [R] Bindings for round kernels
add 87e2ad5 ARROW-13573: [C++] Support dictionaries natively in case_when
add 006394a ARROW-14049: [C++][Java] Upgrade ORC to 1.7.0
add e09b309 ARROW-14040: [C++] Fix result order dependence in scanner test
add d0a4263 ARROW-13852: [R] Handle Dataset schema metadata in ExecPlan
add 44a38b3 ARROW-13502: [R] Bindings for min/max aggregation
add 6ed712a ARROW-13670: [C++] add virtual destructors
add ea1c020 ARROW-13755: [Python] Allow writing datasets using a
partitioning that only specifies field_names
add 0614895 ARROW-13652: [Python] Expose copy_files in pyarrow.fs
add 6c23e17 ARROW-14043: [Python] Allow unsigned integer index type in
dictionary() type factory function
add 18756c2 ARROW-10415: [R] Support for dplyr::distinct()
add 1aae71f ARROW-14055: [Docs] Add canonical url to the sphinx docs
add cecca46 ARROW-14050: [C++] Make TDigest/Quantile kernels return nulls
instead
add ce34ea1 ARROW-14062: [Format] Initial arrow-internal specification of
compute IR
add 8a1da8a ARROW-14057: [C++] Bump aws-c-common version
add 416655d ARROW-14053: [C++][CSV] Use atomic counter for async tests
add 47f7e13 ARROW-13967: [Go] Implement Concatenate function for
array.Interface
add ec7aeb5 PARQUET-2089: [C++] Align RowGroup file_offset with
specification
add 251ca7a ARROW-13327: [C++][Python] Improve consistency of explicit
C++ types in PyArrow files
add 18ca1e9 ARROW-13532: [C++][Compute] - adding set membership type
filtering to hash table interface
add 3317f83 ARROW-14056: [Doc][C++] Document ArrayData
add 075c6c6 ARROW-14064: [CI] Use Debian 11
add 261bb7b unused
add 8c6321f use static methods in AltrepArrayString
add f763d31 lint
add 0d2a9bf use static methods for primitive too
add bfd426a Move Min(), Max() and Sum() inside AltrepArrayPrimitive
add 72fe1a4 Materialize() returns data2
add f546247 factor out shared code between altrep types.
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (841e241)
\
N -- N -- N refs/heads/ARROW_13112_altrep_strings (f546247)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.env | 2 +-
.gitattributes | 2 +-
.github/workflows/comment_bot.yml | 4 +-
.github/workflows/cpp.yml | 2 +-
.github/workflows/go.yml | 35 +-
.github/workflows/js.yml | 2 +-
.github/workflows/r.yml | 2 +-
c_glib/arrow-glib/orc-file-reader.cpp | 53 +-
c_glib/test/dataset/test-partitioning.rb | 2 +-
ci/appveyor-cpp-setup.bat | 6 +
.../debian-10-go-cgo-python.dockerfile} | 30 +-
ci/docker/debian-10-go.dockerfile | 3 +-
ci/docker/debian-11-cpp.dockerfile | 98 +
.../debian-11-go-cgo-python.dockerfile} | 30 +-
...an-10-go.dockerfile => debian-11-go.dockerfile} | 3 +-
...an-10-js.dockerfile => debian-11-js.dockerfile} | 0
ci/scripts/PKGBUILD | 6 +-
ci/scripts/{go_test.sh => go_cgo_python_test.sh} | 33 +-
ci/scripts/go_test.sh | 16 +-
ci/scripts/msys2_setup.sh | 1 +
cpp/apidoc/Doxyfile | 4 +-
cpp/build-support/build-lz4-lib.sh | 11 +-
cpp/build-support/lint_exclusions.txt | 1 +
cpp/build-support/update-flatbuffers.sh | 45 +-
cpp/cmake_modules/ThirdpartyToolchain.cmake | 5 +-
cpp/examples/arrow/compute_register_example.cc | 2 +-
cpp/src/arrow/CMakeLists.txt | 1 +
cpp/src/arrow/array.h | 12 +
cpp/src/arrow/array/array_binary.h | 6 +
cpp/src/arrow/array/array_binary_test.cc | 13 +-
cpp/src/arrow/array/array_decimal.h | 6 +
cpp/src/arrow/array/array_list_test.cc | 34 +
cpp/src/arrow/array/array_nested.cc | 5 +
cpp/src/arrow/array/array_nested.h | 13 +
cpp/src/arrow/array/array_primitive.h | 84 +-
cpp/src/arrow/array/array_test.cc | 12 +-
cpp/src/arrow/array/builder_base.cc | 10 +-
cpp/src/arrow/array/builder_base.h | 13 +-
cpp/src/arrow/array/builder_dict.cc | 39 +-
cpp/src/arrow/array/builder_dict.h | 110 +
cpp/src/arrow/array/concatenate_test.cc | 5 +-
cpp/src/arrow/builder.cc | 269 +--
cpp/src/arrow/compute/api_scalar.cc | 136 ++
cpp/src/arrow/compute/api_scalar.h | 149 +-
cpp/src/arrow/compute/api_vector.cc | 18 +
cpp/src/arrow/compute/api_vector.h | 69 +-
cpp/src/arrow/compute/exec.cc | 5 -
cpp/src/arrow/compute/exec/aggregate_node.cc | 62 +-
cpp/src/arrow/compute/exec/exec_plan.cc | 44 +-
cpp/src/arrow/compute/exec/exec_plan.h | 9 +-
cpp/src/arrow/compute/exec/filter_node.cc | 5 +-
cpp/src/arrow/compute/exec/key_encode.cc | 16 +-
cpp/src/arrow/compute/exec/key_map.cc | 700 ++++--
cpp/src/arrow/compute/exec/key_map.h | 100 +-
cpp/src/arrow/compute/exec/key_map_avx2.cc | 193 +-
cpp/src/arrow/compute/exec/plan_test.cc | 94 +
cpp/src/arrow/compute/exec/project_node.cc | 20 +-
cpp/src/arrow/compute/exec/sink_node.cc | 7 +-
cpp/src/arrow/compute/exec/source_node.cc | 30 +-
cpp/src/arrow/compute/exec/test_util.cc | 2 +-
cpp/src/arrow/compute/exec/union_node.cc | 2 +-
cpp/src/arrow/compute/function.h | 2 +-
cpp/src/arrow/compute/function_test.cc | 20 +-
cpp/src/arrow/compute/kernel.h | 11 +-
cpp/src/arrow/compute/kernels/CMakeLists.txt | 2 +
cpp/src/arrow/compute/kernels/aggregate_basic.cc | 67 +-
.../arrow/compute/kernels/aggregate_basic_avx2.cc | 22 +-
.../compute/kernels/aggregate_basic_avx512.cc | 17 +-
.../compute/kernels/aggregate_basic_internal.h | 141 +-
cpp/src/arrow/compute/kernels/aggregate_internal.h | 4 +
.../arrow/compute/kernels/aggregate_quantile.cc | 42 +-
cpp/src/arrow/compute/kernels/aggregate_tdigest.cc | 24 +-
cpp/src/arrow/compute/kernels/aggregate_test.cc | 303 ++-
cpp/src/arrow/compute/kernels/codegen_internal.cc | 226 +-
cpp/src/arrow/compute/kernels/codegen_internal.h | 41 +-
cpp/src/arrow/compute/kernels/hash_aggregate.cc | 286 ++-
.../arrow/compute/kernels/hash_aggregate_test.cc | 208 +-
cpp/src/arrow/compute/kernels/scalar_arithmetic.cc | 459 +++-
.../compute/kernels/scalar_arithmetic_test.cc | 313 ++-
.../arrow/compute/kernels/scalar_cast_numeric.cc | 7 +-
cpp/src/arrow/compute/kernels/scalar_cast_test.cc | 53 +-
cpp/src/arrow/compute/kernels/scalar_compare.cc | 12 +-
.../arrow/compute/kernels/scalar_compare_test.cc | 114 +
cpp/src/arrow/compute/kernels/scalar_if_else.cc | 37 +-
.../arrow/compute/kernels/scalar_if_else_test.cc | 193 ++
cpp/src/arrow/compute/kernels/scalar_nested.cc | 134 ++
.../arrow/compute/kernels/scalar_nested_test.cc | 68 +
cpp/src/arrow/compute/kernels/scalar_string.cc | 2 +
cpp/src/arrow/compute/kernels/scalar_temporal.cc | 484 ++--
.../arrow/compute/kernels/scalar_temporal_test.cc | 206 +-
cpp/src/arrow/compute/kernels/select_k_test.cc | 716 ++++++
cpp/src/arrow/compute/kernels/test_util.cc | 92 +-
cpp/src/arrow/compute/kernels/test_util.h | 95 +
cpp/src/arrow/compute/kernels/util_internal.h | 13 +-
cpp/src/arrow/compute/kernels/vector_nested.cc | 28 +-
.../arrow/compute/kernels/vector_nested_test.cc | 46 +
cpp/src/arrow/compute/kernels/vector_sort.cc | 649 +++++-
cpp/src/arrow/compute/kernels/vector_sort_test.cc | 146 +-
.../arrow/compute/kernels/vector_topk_benchmark.cc | 59 +
cpp/src/arrow/csv/invalid_row.h | 56 +
cpp/src/arrow/csv/options.h | 3 +
cpp/src/arrow/csv/parser.cc | 67 +-
cpp/src/arrow/csv/parser.h | 41 +-
cpp/src/arrow/csv/parser_test.cc | 121 +
cpp/src/arrow/csv/reader.cc | 9 +-
cpp/src/arrow/csv/reader_test.cc | 113 +-
cpp/src/arrow/csv/test_common.cc | 10 +-
cpp/src/arrow/csv/test_common.h | 4 +-
cpp/src/arrow/dataset/partition.cc | 2 +-
cpp/src/arrow/dataset/partition.h | 2 +-
cpp/src/arrow/dataset/scanner.cc | 15 +-
cpp/src/arrow/dataset/scanner_test.cc | 77 +-
cpp/src/arrow/dataset/test_util.h | 26 +-
cpp/src/arrow/flight/CMakeLists.txt | 3 +
cpp/src/arrow/io/compressed_test.cc | 6 +
cpp/src/arrow/io/file.cc | 23 +-
cpp/src/arrow/io/hdfs_internal.cc | 61 +-
cpp/src/arrow/ipc/json_simple.cc | 19 +
cpp/src/arrow/ipc/json_simple.h | 5 +
cpp/src/arrow/ipc/json_simple_test.cc | 24 +
cpp/src/arrow/pretty_print_test.cc | 7 +-
cpp/src/arrow/python/arrow_to_pandas.cc | 1 +
cpp/src/arrow/python/python_test.cc | 2 +-
cpp/src/arrow/scalar.cc | 3 +-
cpp/src/arrow/scalar.h | 13 +
cpp/src/arrow/scalar_test.cc | 8 +
cpp/src/arrow/testing/gtest_compat.h | 4 +
cpp/src/arrow/testing/gtest_util.cc | 9 +
cpp/src/arrow/testing/gtest_util.h | 14 +
cpp/src/arrow/type.cc | 125 ++
cpp/src/arrow/type.h | 454 ++--
cpp/src/arrow/type_fwd.h | 5 +-
cpp/src/arrow/type_traits.h | 4 +-
cpp/src/arrow/util/CMakeLists.txt | 1 +
cpp/src/arrow/util/async_generator.h | 44 +
cpp/src/arrow/util/async_generator_test.cc | 12 +
cpp/src/arrow/util/async_util.cc | 82 +
cpp/src/arrow/util/async_util.h | 132 ++
cpp/src/arrow/util/async_util_test.cc | 154 ++
cpp/src/arrow/util/compression_test.cc | 5 +
cpp/src/arrow/util/decimal.h | 23 +
.../gtest_compat.h => util/math_constants.h} | 19 +-
cpp/src/arrow/util/tdigest.cc | 5 +-
cpp/src/arrow/util/value_parsing.h | 43 +
cpp/src/arrow/util/value_parsing_benchmark.cc | 48 +
cpp/src/arrow/util/value_parsing_test.cc | 86 +
cpp/src/arrow/util/variant_test.cc | 37 +-
cpp/src/arrow/vendored/datetime/README.md | 9 +-
cpp/src/arrow/vendored/datetime/date.h | 466 +++-
cpp/src/arrow/vendored/datetime/ios.h | 4 +-
cpp/src/arrow/vendored/datetime/ios.mm | 110 +-
cpp/src/arrow/vendored/datetime/tz.cpp | 252 ++-
cpp/src/arrow/vendored/datetime/tz.h | 99 +-
cpp/src/arrow/vendored/datetime/tz_private.h | 8 +-
cpp/src/generated/Expression_generated.h | 1786 +++++++++++++++
cpp/src/generated/File_generated.h | 3 +
cpp/src/generated/Literal_generated.h | 2243 +++++++++++++++++++
cpp/src/generated/Message_generated.h | 12 +
cpp/src/generated/Plan_generated.h | 115 +
cpp/src/generated/Relation_generated.h | 1568 +++++++++++++
cpp/src/generated/Schema_generated.h | 238 +-
cpp/src/generated/SparseTensor_generated.h | 12 +
cpp/src/generated/Tensor_generated.h | 6 +
cpp/src/generated/feather_generated.h | 21 +
cpp/src/parquet/column_writer.cc | 45 +-
cpp/src/parquet/metadata.cc | 20 +-
cpp/src/parquet/metadata_test.cc | 23 +-
cpp/thirdparty/versions.txt | 12 +-
dev/release/rat_exclude_files.txt | 1 +
dev/tasks/conda-recipes/arrow-cpp/bld-arrow.bat | 1 +
dev/tasks/r/azure.linux.yml | 13 +-
dev/tasks/tasks.yml | 47 +-
docker-compose.yml | 41 +-
docs/source/conf.py | 4 +
docs/source/cpp/api/array.rst | 58 +-
docs/source/cpp/api/datatype.rst | 83 +-
docs/source/cpp/arrays.rst | 47 +-
docs/source/cpp/compute.rst | 350 ++-
docs/source/cpp/memory.rst | 24 +-
docs/source/developers/computeir.rst | 59 +
docs/source/developers/cpp/building.rst | 15 +-
docs/source/format/Columnar.rst | 2 +
docs/source/index.rst | 1 +
docs/source/python/api/compute.rst | 10 +-
docs/source/python/dataset.rst | 2 +-
docs/source/python/filesystems.rst | 73 +-
docs/source/status.rst | 40 +-
experimental/computeir/Expression.fbs | 209 ++
experimental/computeir/Literal.fbs | 205 ++
.../computeir/Plan.fbs | 19 +-
experimental/computeir/Relation.fbs | 209 ++
go/arrow/array/builder.go | 3 +
go/arrow/array/concat.go | 367 ++++
go/arrow/array/concat_test.go | 301 +++
go/arrow/bitutil/bitmaps.go | 438 ++++
go/arrow/bitutil/bitmaps_test.go | 358 +++
go/arrow/bitutil/bitutil.go | 61 +
go/arrow/bitutil/bitutil_test.go | 33 +
{cpp/src => go/arrow/cdata}/arrow/c/abi.h | 0
{cpp/src => go/arrow/cdata}/arrow/c/helpers.h | 0
go/arrow/cdata/cdata.go | 574 +++++
go/arrow/cdata/cdata_fulltest.c | 379 ++++
go/arrow/cdata/cdata_test.go | 621 ++++++
go/arrow/cdata/cdata_test_framework.go | 248 +++
go/arrow/cdata/interface.go | 161 ++
go/arrow/{ => cdata/test}/go.mod | 20 +-
go/arrow/{ => cdata/test}/go.sum | 13 -
go/arrow/cdata/test/test_cimport.go | 101 +
go/arrow/cdata/test/test_export_to_cgo.py | 95 +
go/arrow/cdata/utils.h | 56 +
go/arrow/datatype_fixedwidth.go | 2 +-
go/arrow/datatype_fixedwidth_test.go | 10 +-
go/arrow/go.mod | 2 +
go/arrow/go.sum | 58 +-
go/arrow/internal/testing/gen/random_array_gen.go | 307 +++
go/parquet/go.mod | 6 +-
go/parquet/go.sum | 79 +-
go/parquet/internal/encoding/boolean_encoder.go | 2 +-
go/parquet/internal/hashing/xxh3_memo_table.go | 8 +-
go/parquet/internal/utils/bit_benchmark_test.go | 105 +-
.../internal/utils/bit_block_counter_test.go | 4 +-
go/parquet/internal/utils/bit_run_reader_test.go | 2 +-
.../internal/utils/bit_set_run_reader_test.go | 33 +-
go/parquet/internal/utils/bitmap_reader.go | 72 -
go/parquet/internal/utils/bitmap_reader_test.go | 75 -
go/parquet/internal/utils/bitmap_writer.go | 160 +-
go/parquet/metadata/app_version.go | 184 ++
go/parquet/metadata/column_chunk.go | 423 ++++
go/parquet/metadata/file.go | 508 +++++
go/parquet/metadata/metadata_test.go | 346 +++
go/parquet/metadata/row_group.go | 177 ++
go/parquet/metadata/stat_compare_test.go | 268 +++
go/parquet/metadata/statistics.go | 552 +++++
go/parquet/metadata/statistics_test.go | 190 ++
go/parquet/metadata/statistics_types.gen.go | 2308 ++++++++++++++++++++
go/parquet/metadata/statistics_types.gen.go.tmpl | 420 ++++
go/parquet/reader_writer_properties_test.go | 2 +-
go/parquet/types.go | 33 +-
go/parquet/writer_properties.go | 6 +-
java/adapter/jdbc/pom.xml | 13 -
.../arrow/adapter/jdbc/ArrowVectorIterator.java | 50 +-
.../arrow/adapter/jdbc/JdbcToArrowConfig.java | 18 +-
.../adapter/jdbc/JdbcToArrowConfigBuilder.java | 9 +
.../adapter/jdbc/AbstractJdbcToArrowTest.java | 1 +
.../arrow/adapter/jdbc/JdbcToArrowConfigTest.java | 10 +-
.../arrow/adapter/jdbc/h2/JdbcToArrowTest.java | 27 +-
.../jdbc/h2/JdbcToArrowVectorIteratorTest.java | 50 +-
java/adapter/orc/pom.xml | 15 +-
java/memory/memory-core/pom.xml | 18 -
.../java/org/apache/arrow/memory/ArrowBuf.java | 43 -
java/memory/memory-netty/pom.xml | 29 -
java/memory/memory-unsafe/pom.xml | 19 -
java/pom.xml | 76 +-
java/vector/pom.xml | 19 +-
.../apache/arrow/vector/ipc/JsonFileWriter.java | 2 +-
python/pyarrow/_compute.pyx | 561 +++--
python/pyarrow/_dataset.pyx | 6 +-
python/pyarrow/_fs.pyx | 42 +
python/pyarrow/_orc.pxd | 18 +-
python/pyarrow/_orc.pyx | 21 +-
python/pyarrow/array.pxi | 7 +-
python/pyarrow/compute.py | 141 +-
python/pyarrow/dataset.py | 58 +-
python/pyarrow/fs.py | 68 +
python/pyarrow/includes/libarrow.pxd | 201 +-
python/pyarrow/includes/libarrow_dataset.pxd | 2 +-
python/pyarrow/includes/libarrow_fs.pxd | 13 +
python/pyarrow/scalar.pxi | 10 +-
python/pyarrow/table.pxi | 4 +-
python/pyarrow/tests/test_compute.py | 377 +++-
python/pyarrow/tests/test_dataset.py | 71 +
python/pyarrow/tests/test_fs.py | 109 +-
python/pyarrow/tests/test_schema.py | 4 +
python/pyarrow/tests/test_types.py | 8 +-
python/pyarrow/types.pxi | 9 +-
r/DESCRIPTION | 1 +
r/Makefile | 8 +-
r/NAMESPACE | 3 +
r/NEWS.md | 27 +-
r/R/arrow-package.R | 7 +-
r/R/dplyr-distinct.R | 46 +
r/R/dplyr-eval.R | 1 +
r/R/dplyr-functions.R | 53 +-
r/R/dplyr-group-by.R | 4 +
r/R/dplyr-mutate.R | 6 +-
r/R/dplyr-summarize.R | 208 +-
r/R/dplyr.R | 5 +
r/R/enums.R | 15 +
r/R/query-engine.R | 27 +-
r/R/util.R | 15 +-
r/configure | 3 +-
r/man/enums.Rd | 5 +
r/src/altrep.cpp | 592 +++--
r/src/compute-exec.cpp | 1 +
r/src/compute.cpp | 88 +-
r/tests/testthat/helper-expectation.R | 4 +-
r/tests/testthat/test-chunked-array.R | 10 +-
r/tests/testthat/test-compute-no-bindings.R | 153 +-
r/tests/testthat/test-dataset.R | 18 +-
r/tests/testthat/test-dplyr-arrange.R | 1 -
r/tests/testthat/test-dplyr-distinct.R | 104 +
r/tests/testthat/test-dplyr-group-by.R | 25 +-
r/tests/testthat/test-dplyr-lubridate.R | 158 +-
r/tests/testthat/test-dplyr-mutate.R | 47 +-
r/tests/testthat/test-dplyr-string-functions.R | 30 +-
r/tests/testthat/test-dplyr-summarize.R | 295 ++-
r/tests/testthat/test-dplyr.R | 135 +-
r/tests/testthat/test-metadata.R | 72 +-
r/tools/nixlibs.R | 26 +-
ruby/red-arrow/README.md | 23 +
ruby/red-arrow/ext/arrow/extconf.rb | 13 +
ruby/red-arrow/lib/arrow/group.rb | 11 +-
ruby/red-arrow/lib/arrow/table.rb | 33 +-
ruby/red-arrow/test/test-group.rb | 45 +-
ruby/red-arrow/test/test-table.rb | 98 +
315 files changed, 29127 insertions(+), 4214 deletions(-)
copy ci/{scripts/go_test.sh => docker/debian-10-go-cgo-python.dockerfile} (60%)
mode change 100755 => 100644
create mode 100644 ci/docker/debian-11-cpp.dockerfile
copy ci/{scripts/go_test.sh => docker/debian-11-go-cgo-python.dockerfile} (60%)
mode change 100755 => 100644
copy ci/docker/{debian-10-go.dockerfile => debian-11-go.dockerfile} (96%)
copy ci/docker/{debian-10-js.dockerfile => debian-11-js.dockerfile} (100%)
copy ci/scripts/{go_test.sh => go_cgo_python_test.sh} (71%)
create mode 100644 cpp/src/arrow/compute/kernels/select_k_test.cc
create mode 100644 cpp/src/arrow/compute/kernels/vector_topk_benchmark.cc
create mode 100644 cpp/src/arrow/csv/invalid_row.h
create mode 100644 cpp/src/arrow/util/async_util.cc
create mode 100644 cpp/src/arrow/util/async_util.h
create mode 100644 cpp/src/arrow/util/async_util_test.cc
copy cpp/src/arrow/{testing/gtest_compat.h => util/math_constants.h} (70%)
create mode 100644 cpp/src/generated/Expression_generated.h
create mode 100644 cpp/src/generated/Literal_generated.h
create mode 100644 cpp/src/generated/Plan_generated.h
create mode 100644 cpp/src/generated/Relation_generated.h
create mode 100644 docs/source/developers/computeir.rst
create mode 100644 experimental/computeir/Expression.fbs
create mode 100644 experimental/computeir/Literal.fbs
copy cpp/src/arrow/testing/gtest_compat.h => experimental/computeir/Plan.fbs
(69%)
create mode 100644 experimental/computeir/Relation.fbs
create mode 100644 go/arrow/array/concat.go
create mode 100644 go/arrow/array/concat_test.go
create mode 100644 go/arrow/bitutil/bitmaps.go
create mode 100644 go/arrow/bitutil/bitmaps_test.go
copy {cpp/src => go/arrow/cdata}/arrow/c/abi.h (100%)
copy {cpp/src => go/arrow/cdata}/arrow/c/helpers.h (100%)
create mode 100644 go/arrow/cdata/cdata.go
create mode 100644 go/arrow/cdata/cdata_fulltest.c
create mode 100644 go/arrow/cdata/cdata_test.go
create mode 100644 go/arrow/cdata/cdata_test_framework.go
create mode 100644 go/arrow/cdata/interface.go
copy go/arrow/{ => cdata/test}/go.mod (54%)
copy go/arrow/{ => cdata/test}/go.sum (92%)
create mode 100644 go/arrow/cdata/test/test_cimport.go
create mode 100644 go/arrow/cdata/test/test_export_to_cgo.py
create mode 100644 go/arrow/cdata/utils.h
create mode 100644 go/arrow/internal/testing/gen/random_array_gen.go
delete mode 100644 go/parquet/internal/utils/bitmap_reader.go
delete mode 100644 go/parquet/internal/utils/bitmap_reader_test.go
create mode 100644 go/parquet/metadata/app_version.go
create mode 100644 go/parquet/metadata/column_chunk.go
create mode 100644 go/parquet/metadata/file.go
create mode 100644 go/parquet/metadata/metadata_test.go
create mode 100644 go/parquet/metadata/row_group.go
create mode 100644 go/parquet/metadata/stat_compare_test.go
create mode 100644 go/parquet/metadata/statistics.go
create mode 100644 go/parquet/metadata/statistics_test.go
create mode 100644 go/parquet/metadata/statistics_types.gen.go
create mode 100644 go/parquet/metadata/statistics_types.gen.go.tmpl
create mode 100644 r/R/dplyr-distinct.R
create mode 100644 r/tests/testthat/test-dplyr-distinct.R