This is an automated email from the ASF dual-hosted git repository.
nevime pushed a change to branch rust-parquet-arrow-writer
in repository https://gitbox.apache.org/repos/asf/arrow.git.
omit 8ccd9c3 ARROW-7842: [Rust] [Parquet] Arrow list reader
omit 453f978 ARROW-10334: [Rust] [Parquet] NullArray roundtrip
omit ead5e14 ARROW-10225: [Rust] [Parquet] Fix null comparison in roundtrip
omit 3a22d3d ARROW-10168: [Rust] [Parquet] Schema roundtrip - use Arrow
schema from Parquet metadata when available
omit b7b45d1 ARROW-10191: [Rust] [Parquet] Add roundtrip Arrow -> Parquet
tests for all supported Arrow DataTypes
omit 6e237bc ARROW-8426: [Rust] [Parquet] Add support for writing
dictionary types
omit 2f81785 ARROW-10095: [Rust] Update rust-parquet-arrow-writer branch's
encode_arrow_schema with ipc changes
omit 923d23b ARROW-8423: [Rust] [Parquet] Serialize Arrow schema metadata
omit 2b8dc08 ARROW-8289: [Rust] Parquet Arrow writer with nested support
omit ab62c28 ARROW-10321: [C++] Use check_cxx_source_compiles for AVX512
detect in compiler
omit 487895f ARROW-10311: [Release] Update crossbow verification process
omit f58db45 ARROW-9898: [C++][Gandiva] Fix linking issue with
castINT/FLOAT functions
omit 2510f4f ARROW-10313: [C++] Faster UTF8 validation for small strings
omit 7189b91 ARROW-9475: [Java] Clean up usages of BaseAllocator, use
BufferAllocator in…
omit 18495e0 ARROW-10294: [Java] Resolve problems of DecimalVector APIs on
ArrowBufs
omit 1d10f22 ARROW-10236: [Rust] Add can_cast_types to arrow cast kernel,
use in DataFusion
omit 35ace39 ARROW-10174: [Java] Fix reading/writing dict structs
omit ed8b1bc ARROW-10145: [C++][Dataset] Assert integer overflow in
partitioning falls back to string
omit ac14e91 ARROW-9479: [JS] Fix Table.from for zero-item serialized
tables, Table.empty for schemas containing compound types (List, FixedSizeList,
Map)
omit 249adb4 ARROW-10270: [R] Fix CSV timestamp_parsers test on R-devel
omit ea29f65 ARROW-10292: [Rust] [DataFusion] Simplify merge
omit becf329 ARROW-10289: [Rust] Read dictionaries in IPC streams
omit 818593f ARROW-10295 [Rust] [DataFusion] Replace Rc<RefCell<>> by
Box<> in accumulators.
omit c5280a5 ARROW-10293: [Rust] [DataFusion] Fixed benchmarks
omit f07a415 ARROW-10263: [C++][Compute] Improve variance kernel numerical
stability
add 0aa2069 [Release] Update CHANGELOG.md for 2.0.0
add e46a3c6 [Release] Update .deb/.rpm changelogs for 2.0.0
add 5943421 [Release] Update versions for 2.0.0
add 4782866 [maven-release-plugin] prepare release apache-arrow-2.0.0
add b1f36ac [Release] Update versions for 3.0.0-SNAPSHOT
add f72575c [Release] Update .deb package names for 3.0.0
add a3a35b2 ARROW-10263: [C++][Compute] Improve variance kernel numerical
stability
add 91b5f07 ARROW-10293: [Rust] [DataFusion] Fixed benchmarks
add a030fc5 ARROW-10295 [Rust] [DataFusion] Replace Rc<RefCell<>> by
Box<> in accumulators.
add 34533b6 ARROW-10289: [Rust] Read dictionaries in IPC streams
add 7209ffc ARROW-10292: [Rust] [DataFusion] Simplify merge
add 9e671ac ARROW-10270: [R] Fix CSV timestamp_parsers test on R-devel
add 8f302d3 ARROW-9479: [JS] Fix Table.from for zero-item serialized
tables, Table.empty for schemas containing compound types (List, FixedSizeList,
Map)
add 03c7c02 ARROW-10145: [C++][Dataset] Assert integer overflow in
partitioning falls back to string
add a7ef5d2 ARROW-10174: [Java] Fix reading/writing dict structs
add 3f69ad2 ARROW-10236: [Rust] Add can_cast_types to arrow cast kernel,
use in DataFusion
add 22027c7 ARROW-10294: [Java] Resolve problems of DecimalVector APIs on
ArrowBufs
add cb58146 ARROW-9475: [Java] Clean up usages of BaseAllocator, use
BufferAllocator in…
add 3f96cc0 ARROW-10313: [C++] Faster UTF8 validation for small strings
add 36bf7a4 ARROW-9898: [C++][Gandiva] Fix linking issue with
castINT/FLOAT functions
add 2fe8b86 ARROW-10311: [Release] Update crossbow verification process
add 3dbea74 ARROW-10321: [C++] Use check_cxx_source_compiles for AVX512
detect in compiler
add be21209 ARROW-5350: [Rust] Allow filtering on simple lists
add 29e9d13 ARROW-10002: [Rust] Remove trait specialization from arrow
crate
add ad2a002 ARROW-10163: [Rust] [DataFusion] Add DictionaryArray coercion
support
add b653922 ARROW-10323: [Release][wheel] Add missing verification setup
step
add 5af12b0 ARROW-10340: [Packaging][deb][RPM] Use Python 3.8 for pygit2
add 6ad24c2 ARROW-10310: [C++][Gandiva] Add single argument round() in
Gandiva
add 139fb51 ARROW-10241: [C++][Compute] Add variance kernel benchmark
add 457935e ARROW-10106: [FlightRPC][Java] Expose onIsReady() callback
add 7944265 ARROW-10203: [Doc] Give guidance on big-endian support in the
contributors docs
add 0e13e28 ARROW-9164: [C++] Add embedded documentation to compute
functions
add 445251a ARROW-9991: [C++] Split kernels for strings/binary
add 04660f8 ARROW-10348: [C++] Fix crash on invalid Parquet data
add 7696b89 ARROW-10338: [Rust] Use const fn for applicable methods
add 2321542 ARROW-10318: [C++] Use pimpl idiom in CSV parser
add 84e4b15 ARROW-10320 [Rust] [DataFusion] Migrated from batch iterators
to batch streams.
add 760284f ARROW-10358: [R] Followups to 2.0.0 release
add f06c68a ARROW-10331: [Rust] [DataFusion] Re-organize DataFusion errors
add bb4f2a0 ARROW-9963: [Python] Recognize datetime.timezone.utc as UTC
on conversion python->pyarrow
add 16412a1 ARROW-10328: [C++] Vendor fast_float number parsing library
add 625bf3f ARROW-10363: [Python] Remove CMake bug workaround in manylinux
add e2d8dc3 ARROW-10304: [C++][Compute] Optimize variance kernel for
integers
add 5ad3c98 ARROW-10369: [Dev] Fix archery release utility test cases
add 843e8bb ARROW-9304: [C++] Add "AppendEmpty" builder APIs for use
inside StructBuilder::AppendNull
add 103489b ARROW-10208: [C++] Fix split string kernels on sliced input
add 39a7539 ARROW-10207: [C++] Allow precomputing output string/list
offsets in kernels
add 982ca83 ARROW-10365: [R] Remove duplicate setting of S3 flag on macOS
add 9cb0e9f ARROW-10249: [Rust] Support nested dictionaries inside list
arrays
add b3992ea ARROW-10333: [Java] Get rid of org.apache.arrow.util in vector
add 590541d ARROW-9747: [Java][C++] Initial Support for 256-bit Decimals
add c49b5da ARROW-10302: [Python] Don't double-package plasma-store-server
add eea8d35 ARROW-10375 [Rust] Removed PrimitiveArrayOps
add da50838 ARROW-10383: [Doc] fix typos
add aa2e6d4 ARROW-10384: [C++] Fix typos
add 221de70 ARROW-10382: [Rust] Fix typos
new 6dc26ed ARROW-8289: [Rust] Parquet Arrow writer with nested support
new dcec335 ARROW-8423: [Rust] [Parquet] Serialize Arrow schema metadata
new 98d4ffc ARROW-10095: [Rust] Update rust-parquet-arrow-writer branch's
encode_arrow_schema with ipc changes
new 5063854 ARROW-8426: [Rust] [Parquet] Add support for writing
dictionary types
new 6677a41 ARROW-10191: [Rust] [Parquet] Add roundtrip Arrow -> Parquet
tests for all supported Arrow DataTypes
new 96798dc ARROW-10168: [Rust] [Parquet] Schema roundtrip - use Arrow
schema from Parquet metadata when available
new 25d96c5 ARROW-10225: [Rust] [Parquet] Fix null comparison in roundtrip
new 0b5a846 ARROW-10334: [Rust] [Parquet] NullArray roundtrip
new 9ba2179 ARROW-7842: [Rust] [Parquet] Arrow list reader
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (8ccd9c3)
\
N -- N -- N refs/heads/rust-parquet-arrow-writer (9ba2179)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
The 9 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
CHANGELOG.md | 584 +++++++++++++++++
LICENSE.txt | 8 +
c_glib/configure.ac | 2 +-
c_glib/meson.build | 2 +-
c_glib/test/test-decimal128.rb | 2 +-
ci/conda_env_archery.yml | 2 +
ci/scripts/PKGBUILD | 2 +-
cpp/CMakeLists.txt | 2 +-
cpp/cmake_modules/FindPython3Alt.cmake | 19 +-
cpp/cmake_modules/UseCython.cmake | 2 +-
cpp/examples/arrow/row-wise-conversion-example.cc | 2 +-
cpp/src/arrow/array/array_base.cc | 4 +
cpp/src/arrow/array/array_decimal.cc | 18 +-
cpp/src/arrow/array/array_decimal.h | 16 +
cpp/src/arrow/array/array_dict_test.cc | 66 +-
cpp/src/arrow/array/array_struct_test.cc | 10 +-
cpp/src/arrow/array/array_test.cc | 118 +++-
cpp/src/arrow/array/array_union_test.cc | 86 ++-
cpp/src/arrow/array/builder_adaptive.h | 20 +
cpp/src/arrow/array/builder_base.h | 16 +
cpp/src/arrow/array/builder_binary.cc | 14 +
cpp/src/arrow/array/builder_binary.h | 24 +-
cpp/src/arrow/array/builder_decimal.cc | 35 +
cpp/src/arrow/array/builder_decimal.h | 29 +
cpp/src/arrow/array/builder_dict.cc | 2 +-
cpp/src/arrow/array/builder_dict.h | 34 +-
cpp/src/arrow/array/builder_nested.cc | 39 +-
cpp/src/arrow/array/builder_nested.h | 52 +-
cpp/src/arrow/array/builder_primitive.h | 34 +
cpp/src/arrow/array/builder_union.h | 60 +-
cpp/src/arrow/array/concatenate.cc | 2 +-
cpp/src/arrow/array/validate.cc | 7 +
cpp/src/arrow/builder.cc | 2 +
cpp/src/arrow/c/bridge.cc | 24 +-
cpp/src/arrow/c/bridge_test.cc | 18 +-
cpp/src/arrow/compare.cc | 16 +
cpp/src/arrow/compute/api_aggregate.h | 6 +-
cpp/src/arrow/compute/api_scalar.h | 19 +
cpp/src/arrow/compute/cast.cc | 20 +-
cpp/src/arrow/compute/exec.cc | 159 +++--
cpp/src/arrow/compute/exec_test.cc | 21 +-
cpp/src/arrow/compute/function.cc | 15 +
cpp/src/arrow/compute/function.h | 60 +-
cpp/src/arrow/compute/function_test.cc | 18 +-
cpp/src/arrow/compute/kernel_test.cc | 14 +-
cpp/src/arrow/compute/kernels/aggregate_basic.cc | 34 +-
.../arrow/compute/kernels/aggregate_benchmark.cc | 48 ++
cpp/src/arrow/compute/kernels/aggregate_mode.cc | 12 +-
cpp/src/arrow/compute/kernels/aggregate_test.cc | 100 ++-
cpp/src/arrow/compute/kernels/aggregate_var_std.cc | 96 ++-
cpp/src/arrow/compute/kernels/scalar_arithmetic.cc | 82 ++-
cpp/src/arrow/compute/kernels/scalar_boolean.cc | 66 +-
cpp/src/arrow/compute/kernels/scalar_cast_test.cc | 2 +-
cpp/src/arrow/compute/kernels/scalar_compare.cc | 50 +-
cpp/src/arrow/compute/kernels/scalar_fill_null.cc | 10 +-
cpp/src/arrow/compute/kernels/scalar_nested.cc | 11 +-
cpp/src/arrow/compute/kernels/scalar_set_lookup.cc | 27 +-
cpp/src/arrow/compute/kernels/scalar_string.cc | 617 ++++++++++++++++--
.../compute/kernels/scalar_string_benchmark.cc | 6 +
.../arrow/compute/kernels/scalar_string_internal.h | 44 --
.../arrow/compute/kernels/scalar_string_test.cc | 82 +++
cpp/src/arrow/compute/kernels/scalar_validity.cc | 21 +-
cpp/src/arrow/compute/kernels/vector_hash.cc | 28 +-
cpp/src/arrow/compute/kernels/vector_nested.cc | 21 +-
cpp/src/arrow/compute/kernels/vector_selection.cc | 46 +-
cpp/src/arrow/compute/kernels/vector_sort.cc | 36 +-
cpp/src/arrow/compute/registry.cc | 2 +
cpp/src/arrow/compute/registry_test.cc | 7 +-
cpp/src/arrow/csv/parser.cc | 725 ++++++++++-----------
cpp/src/arrow/csv/parser.h | 158 ++---
cpp/src/arrow/csv/parser_benchmark.cc | 94 ++-
cpp/src/arrow/dataset/dataset_test.cc | 2 +-
cpp/src/arrow/dataset/filter.cc | 1 +
cpp/src/arrow/dbi/hiveserver2/thrift/Types.thrift | 2 +-
cpp/src/arrow/ipc/json_simple.cc | 25 +-
cpp/src/arrow/ipc/json_simple_test.cc | 52 +-
cpp/src/arrow/ipc/message.h | 2 +-
cpp/src/arrow/ipc/metadata_internal.cc | 27 +-
cpp/src/arrow/ipc/reader.h | 2 +-
cpp/src/arrow/ipc/writer.cc | 2 +-
cpp/src/arrow/json/parser_test.cc | 13 +-
cpp/src/arrow/pretty_print.cc | 9 +-
cpp/src/arrow/pretty_print_test.cc | 22 +-
cpp/src/arrow/python/arrow_to_pandas.cc | 31 +-
cpp/src/arrow/python/datetime.cc | 10 +
cpp/src/arrow/python/decimal.cc | 44 +-
cpp/src/arrow/python/decimal.h | 18 +
cpp/src/arrow/python/inference.cc | 13 +-
cpp/src/arrow/python/python_test.cc | 67 +-
cpp/src/arrow/python/python_to_arrow.cc | 6 +
cpp/src/arrow/scalar.cc | 8 +
cpp/src/arrow/scalar.h | 11 +
cpp/src/arrow/scalar_test.cc | 21 +-
cpp/src/arrow/testing/gtest_util.cc | 3 +-
cpp/src/arrow/testing/json_internal.cc | 39 +-
cpp/src/arrow/type.cc | 38 +-
cpp/src/arrow/type.h | 37 +-
cpp/src/arrow/type_fwd.h | 26 +-
cpp/src/arrow/type_test.cc | 90 ++-
cpp/src/arrow/type_traits.h | 28 +-
cpp/src/arrow/util/basic_decimal.cc | 233 +++++--
cpp/src/arrow/util/basic_decimal.h | 104 ++-
cpp/src/arrow/util/bit_run_reader.h | 4 +-
cpp/src/arrow/util/bit_util_benchmark.cc | 2 +-
cpp/src/arrow/util/bit_util_test.cc | 2 +-
cpp/src/arrow/util/byte_stream_split.h | 10 +-
cpp/src/arrow/util/decimal.cc | 135 +++-
cpp/src/arrow/util/decimal.h | 63 ++
cpp/src/arrow/util/decimal_benchmark.cc | 20 +-
cpp/src/arrow/util/decimal_test.cc | 154 +++++
cpp/src/arrow/util/hashing.h | 4 +-
cpp/src/arrow/util/spaced.h | 2 +-
cpp/src/arrow/util/utf8.h | 53 +-
cpp/src/arrow/util/utf8_util_test.cc | 38 ++
cpp/src/arrow/util/value_parsing.cc | 61 +-
cpp/src/arrow/vendored/fast_float/README.md | 9 +
cpp/src/arrow/vendored/fast_float/ascii_number.h | 313 +++++++++
.../arrow/vendored/fast_float/decimal_to_binary.h | 167 +++++
cpp/src/arrow/vendored/fast_float/fast_float.h | 47 ++
cpp/src/arrow/vendored/fast_float/fast_table.h | 691 ++++++++++++++++++++
cpp/src/arrow/vendored/fast_float/float_common.h | 263 ++++++++
cpp/src/arrow/vendored/fast_float/parse_number.h | 118 ++++
cpp/src/arrow/vendored/fast_float/thompson_tao.h | 375 +++++++++++
cpp/src/arrow/visitor.cc | 3 +
cpp/src/arrow/visitor.h | 3 +
cpp/src/arrow/visitor_inline.h | 1 +
cpp/src/gandiva/engine.cc | 2 +-
cpp/src/gandiva/function_registry_arithmetic.cc | 4 +
cpp/src/gandiva/precompiled/extended_math_ops.cc | 24 +-
.../gandiva/precompiled/extended_math_ops_test.cc | 10 +
cpp/src/gandiva/precompiled/types.h | 4 +
cpp/src/parquet/arrow/reader.cc | 4 +
cpp/src/parquet/arrow/reader_internal.cc | 4 +-
cpp/src/parquet/arrow/schema.cc | 3 +-
cpp/src/parquet/arrow/writer.cc | 1 -
cpp/src/parquet/column_writer.cc | 15 +-
csharp/Directory.Build.props | 2 +-
dev/archery/archery/integration/datagen.py | 37 +-
dev/archery/archery/release.py | 5 +-
dev/archery/archery/tests/test_release.py | 24 +-
dev/release/rat_exclude_files.txt | 56 +-
dev/release/verify-release-candidate.sh | 2 +
dev/tasks/conda-recipes/arrow-cpp/build-pyarrow.sh | 1 +
dev/tasks/conda-recipes/arrow-cpp/meta.yaml | 1 +
dev/tasks/homebrew-formulae/apache-arrow.rb | 2 +-
.../homebrew-formulae/autobrew/apache-arrow.rb | 2 +-
.../apache-arrow-archive-keyring/debian/changelog | 6 +
.../yum/apache-arrow-release.spec.in | 3 +
.../apache-arrow/debian.ubuntu-xenial/changelog | 6 +
.../apache-arrow/debian.ubuntu-xenial/control | 86 +--
.../libarrow-cuda-glib300.install} | 0
.../libarrow-cuda300.install} | 0
.../libarrow-dataset-glib300.install} | 0
.../libarrow-dataset300.install} | 0
.../libarrow-glib300.install} | 0
.../libarrow-python300.install} | 0
.../libarrow300.install} | 0
.../libgandiva-glib300.install} | 0
.../libgandiva300.install} | 0
.../libparquet-glib300.install} | 0
.../libparquet300.install} | 0
.../libplasma-glib300.install} | 0
.../libplasma300.install} | 0
.../linux-packages/apache-arrow/debian/changelog | 6 +
.../linux-packages/apache-arrow/debian/control | 102 +--
.../libarrow-cuda-glib300.install} | 0
.../libarrow-cuda300.install} | 0
.../libarrow-dataset-glib300.install} | 0
.../libarrow-dataset300.install} | 0
...light200.install => libarrow-flight300.install} | 0
.../libarrow-glib300.install} | 0
...0.install => libarrow-python-flight300.install} | 0
.../libarrow-python300.install} | 0
.../libarrow300.install} | 0
.../libgandiva-glib300.install} | 0
.../libgandiva300.install} | 0
.../libparquet-glib300.install} | 0
.../libparquet300.install} | 0
.../libplasma-glib300.install} | 0
.../libplasma300.install} | 0
.../linux-packages/apache-arrow/yum/arrow.spec.in | 3 +
dev/tasks/linux-packages/github.linux.amd64.yml | 4 +-
dev/tasks/tasks.yml | 304 ++++-----
docs/source/cpp/compute.rst | 31 +
docs/source/developers/archery.rst | 2 +-
docs/source/developers/contributing.rst | 48 ++
docs/source/developers/docker.rst | 2 +-
docs/source/format/Columnar.rst | 2 +
docs/source/java/vector.rst | 2 +-
docs/source/python/api/compute.rst | 2 +-
docs/source/python/dataset.rst | 2 +-
docs/source/python/ipc.rst | 18 -
java/adapter/avro/pom.xml | 2 +-
.../java/org/apache/arrow/AvroToArrowUtils.java | 2 +-
java/adapter/jdbc/pom.xml | 2 +-
.../arrow/adapter/jdbc/JdbcToArrowUtils.java | 2 +-
java/adapter/orc/pom.xml | 2 +-
java/algorithm/pom.xml | 2 +-
.../algorithm/deduplicate/DeduplicationUtils.java | 2 +-
.../deduplicate/VectorRunDeduplicator.java | 2 +-
.../deduplicate/TestDeduplicationUtils.java | 2 +-
java/flight/flight-core/pom.xml | 2 +-
.../apache/arrow/flight/BackpressureStrategy.java | 172 +++++
.../org/apache/arrow/flight/FlightService.java | 17 +-
.../arrow/flight/OutboundStreamListener.java | 13 +
.../arrow/flight/OutboundStreamListenerImpl.java | 5 +
.../org/apache/arrow/flight/TestBackPressure.java | 159 ++++-
.../arrow/flight/perf/PerformanceTestServer.java | 141 ++--
java/flight/flight-grpc/pom.xml | 2 +-
java/format/pom.xml | 2 +-
java/gandiva/pom.xml | 2 +-
.../arrow/gandiva/evaluator/DecimalTypeUtil.java | 2 +-
.../gandiva/evaluator/ExpressionRegistry.java | 2 +-
.../gandiva/evaluator/DecimalTypeUtilTest.java | 2 +-
.../gandiva/evaluator/ProjectorDecimalTest.java | 34 +-
java/memory/memory-core/pom.xml | 2 +-
.../java/org/apache/arrow/util/AutoCloseables.java | 0
.../java/org/apache/arrow/util/Collections2.java | 0
java/memory/memory-netty/pom.xml | 2 +-
java/memory/memory-unsafe/pom.xml | 2 +-
java/memory/pom.xml | 2 +-
java/performance/pom.xml | 6 +-
java/plasma/pom.xml | 2 +-
java/pom.xml | 4 +-
java/tools/pom.xml | 2 +-
java/vector/pom.xml | 2 +-
java/vector/src/main/codegen/data/ArrowTypes.tdd | 2 +-
.../src/main/codegen/data/ValueVectorTypes.tdd | 17 +
.../codegen/templates/AbstractFieldWriter.java | 8 +-
.../templates/AbstractPromotableFieldWriter.java | 26 +-
.../src/main/codegen/templates/ArrowType.java | 18 +-
.../src/main/codegen/templates/ComplexCopier.java | 5 +-
.../src/main/codegen/templates/ComplexWriters.java | 27 +-
.../main/codegen/templates/DenseUnionReader.java | 4 +-
.../main/codegen/templates/DenseUnionVector.java | 15 +-
.../main/codegen/templates/DenseUnionWriter.java | 12 +-
.../main/codegen/templates/HolderReaderImpl.java | 5 +
.../src/main/codegen/templates/StructWriters.java | 4 +-
.../templates/UnionFixedSizeListWriter.java | 54 ++
.../main/codegen/templates/UnionListWriter.java | 46 ++
.../src/main/codegen/templates/UnionMapWriter.java | 15 +
.../src/main/codegen/templates/UnionReader.java | 8 +-
.../src/main/codegen/templates/UnionVector.java | 31 +-
.../src/main/codegen/templates/UnionWriter.java | 50 +-
.../org/apache/arrow/vector/BaseValueVector.java | 2 +-
.../org/apache/arrow/vector/BitVectorHelper.java | 2 +-
.../java/org/apache/arrow/vector/BufferLayout.java | 5 +-
.../{DecimalVector.java => Decimal256Vector.java} | 91 +--
.../org/apache/arrow/vector/DecimalVector.java | 6 +-
.../java/org/apache/arrow/vector/TypeLayout.java | 2 +-
.../vector/complex/impl/PromotableWriter.java | 38 +-
.../apache/arrow/vector/ipc/JsonFileReader.java | 24 +-
.../apache/arrow/vector/ipc/JsonFileWriter.java | 13 +-
.../arrow/vector/ipc/message/ArrowRecordBatch.java | 2 +-
.../java/org/apache/arrow/vector/types/Types.java | 19 +
.../{ => vector}/util/DataSizeRoundingUtil.java | 2 +-
.../apache/arrow/vector/util/DecimalUtility.java | 50 +-
.../vector/validate/ValidateVectorTypeVisitor.java | 3 +-
...ecimalVector.java => TestDecimal256Vector.java} | 112 ++--
.../org/apache/arrow/vector/TestDecimalVector.java | 20 +-
.../org/apache/arrow/vector/TestTypeLayout.java | 6 +-
.../org/apache/arrow/vector/TestVectorAlloc.java | 2 +-
.../org/apache/arrow/vector/TestVectorReAlloc.java | 2 +-
.../vector/complex/impl/TestComplexCopier.java | 18 +-
.../vector/complex/writer/TestComplexWriter.java | 6 +-
.../apache/arrow/vector/types/pojo/TestSchema.java | 2 +-
.../arrow/vector/util/DecimalUtilityTest.java | 119 ++--
.../util/TestDataSizeRoundingUtil.java | 2 +-
js/package.json | 2 +-
matlab/CMakeLists.txt | 2 +-
python/manylinux1/build_arrow.sh | 8 -
python/manylinux1/scripts/install_cmake.sh | 2 +-
python/manylinux201x/build_arrow.sh | 8 -
python/manylinux201x/scripts/install_cmake.sh | 3 +-
python/pyarrow/__init__.py | 11 +-
python/pyarrow/_compute.pxd | 10 -
python/pyarrow/_compute.pyx | 254 ++++++--
python/pyarrow/_dataset.pyx | 11 +-
python/pyarrow/array.pxi | 9 +-
python/pyarrow/compute.py | 158 +++--
python/pyarrow/includes/libarrow.pxd | 52 +-
python/pyarrow/lib.pxd | 9 +
python/pyarrow/lib.pyx | 3 +-
python/pyarrow/plasma.py | 4 +
python/pyarrow/public-api.pxi | 4 +-
python/pyarrow/scalar.pxi | 23 +-
python/pyarrow/tests/strategies.py | 12 +-
python/pyarrow/tests/test_array.py | 10 +-
python/pyarrow/tests/test_compute.py | 125 +++-
python/pyarrow/tests/test_convert_builtin.py | 41 +-
python/pyarrow/tests/test_scalars.py | 23 +-
python/pyarrow/tests/test_schema.py | 1 +
python/pyarrow/tests/test_types.py | 33 +-
python/pyarrow/types.pxi | 49 +-
python/pyarrow/types.py | 17 +-
python/setup.py | 10 +-
r/.Rbuildignore | 1 +
r/.gitignore | 1 +
r/DESCRIPTION | 4 +-
r/NEWS.md | 5 +-
r/R/enums.R | 25 +-
r/R/parquet.R | 22 +-
r/man/write_parquet.Rd | 12 +-
r/tools/autobrew | 2 +-
r/tools/linuxlibs.R | 6 +
ruby/red-arrow-cuda/lib/arrow-cuda/version.rb | 2 +-
.../red-arrow-dataset/lib/arrow-dataset/version.rb | 2 +-
ruby/red-arrow/lib/arrow/version.rb | 2 +-
ruby/red-arrow/test/test-struct-array-builder.rb | 16 +-
ruby/red-arrow/test/test-struct-array.rb | 4 +-
ruby/red-gandiva/lib/gandiva/version.rb | 2 +-
ruby/red-parquet/lib/parquet/version.rb | 2 +-
ruby/red-plasma/lib/plasma/version.rb | 2 +-
rust/arrow-flight/Cargo.toml | 4 +-
rust/arrow-flight/src/utils.rs | 2 +-
rust/arrow/Cargo.toml | 2 +-
rust/arrow/examples/tensor_builder.rs | 2 +-
rust/arrow/src/array/array.rs | 271 ++++----
rust/arrow/src/array/builder.rs | 208 +++---
rust/arrow/src/array/data.rs | 33 +-
rust/arrow/src/array/equal.rs | 39 +-
rust/arrow/src/array/iterator.rs | 2 +-
rust/arrow/src/array/mod.rs | 3 +-
rust/arrow/src/array/union.rs | 6 +-
rust/arrow/src/bitmap.rs | 6 +-
rust/arrow/src/buffer.rs | 22 +-
rust/arrow/src/compute/kernels/arithmetic.rs | 8 +-
rust/arrow/src/compute/kernels/boolean.rs | 1 -
rust/arrow/src/compute/kernels/cast.rs | 4 +-
rust/arrow/src/compute/kernels/filter.rs | 373 ++++++++++-
rust/arrow/src/compute/kernels/sort.rs | 2 +-
rust/arrow/src/compute/kernels/take.rs | 2 +-
rust/arrow/src/datatypes.rs | 32 +-
rust/arrow/src/json/reader.rs | 595 +++++++++++++----
rust/arrow/src/lib.rs | 7 +-
rust/arrow/src/record_batch.rs | 2 +-
rust/arrow/src/tensor.rs | 4 +-
rust/arrow/src/util/bit_chunk_iterator.rs | 4 +-
rust/arrow/src/util/bit_util.rs | 4 +-
rust/arrow/src/util/display.rs | 2 +-
rust/arrow/test/data/list_string_dict_nested.json | 3 +
rust/benchmarks/Cargo.toml | 2 +-
rust/datafusion/Cargo.toml | 10 +-
rust/datafusion/README.md | 2 +-
rust/datafusion/examples/flight_server.rs | 2 +-
rust/datafusion/examples/simple_udaf.rs | 4 +-
rust/datafusion/examples/simple_udf.rs | 4 +-
rust/datafusion/src/datasource/csv.rs | 4 +-
rust/datafusion/src/datasource/memory.rs | 44 +-
rust/datafusion/src/datasource/parquet.rs | 11 +-
rust/datafusion/src/error.rs | 105 ++-
rust/datafusion/src/execution/context.rs | 37 +-
rust/datafusion/src/logical_plan/mod.rs | 28 +-
.../src/optimizer/projection_push_down.rs | 4 +-
rust/datafusion/src/optimizer/utils.rs | 8 +-
rust/datafusion/src/physical_plan/aggregates.rs | 10 +-
.../src/physical_plan/array_expressions.rs | 8 +-
rust/datafusion/src/physical_plan/common.rs | 46 +-
rust/datafusion/src/physical_plan/csv.rs | 39 +-
.../src/physical_plan/datetime_expressions.rs | 14 +-
.../src/physical_plan/distinct_expressions.rs | 6 +-
rust/datafusion/src/physical_plan/empty.rs | 16 +-
rust/datafusion/src/physical_plan/explain.rs | 14 +-
rust/datafusion/src/physical_plan/expressions.rs | 235 +++++--
rust/datafusion/src/physical_plan/filter.rs | 108 +--
rust/datafusion/src/physical_plan/functions.rs | 17 +-
rust/datafusion/src/physical_plan/group_scalar.rs | 14 +-
.../datafusion/src/physical_plan/hash_aggregate.rs | 232 ++++---
rust/datafusion/src/physical_plan/limit.rs | 39 +-
.../src/physical_plan/math_expressions.rs | 10 +-
rust/datafusion/src/physical_plan/memory.rs | 36 +-
rust/datafusion/src/physical_plan/merge.rs | 61 +-
rust/datafusion/src/physical_plan/mod.rs | 21 +-
rust/datafusion/src/physical_plan/parquet.rs | 48 +-
rust/datafusion/src/physical_plan/planner.rs | 40 +-
rust/datafusion/src/physical_plan/projection.rs | 74 ++-
rust/datafusion/src/physical_plan/sort.rs | 18 +-
.../src/physical_plan/string_expressions.rs | 6 +-
rust/datafusion/src/physical_plan/type_coercion.rs | 10 +-
rust/datafusion/src/scalar.rs | 17 +-
rust/datafusion/src/sql/parser.rs | 2 +-
rust/datafusion/src/sql/planner.rs | 52 +-
rust/datafusion/src/test/mod.rs | 1 -
rust/datafusion/src/variable/mod.rs | 2 +-
rust/datafusion/tests/sql.rs | 74 ++-
rust/datafusion/tests/user_defined_plan.rs | 115 ++--
rust/integration-testing/Cargo.toml | 2 +-
rust/parquet/Cargo.toml | 6 +-
rust/parquet/README.md | 4 +-
rust/parquet/src/arrow/array_reader.rs | 6 +-
rust/parquet/src/arrow/arrow_writer.rs | 2 +-
rust/parquet/src/arrow/converter.rs | 4 +-
rust/parquet_derive/Cargo.toml | 4 +-
rust/parquet_derive/README.md | 4 +-
rust/parquet_derive_test/Cargo.toml | 6 +-
395 files changed, 10971 insertions(+), 3342 deletions(-)
delete mode 100644 cpp/src/arrow/compute/kernels/scalar_string_internal.h
create mode 100644 cpp/src/arrow/vendored/fast_float/README.md
create mode 100644 cpp/src/arrow/vendored/fast_float/ascii_number.h
create mode 100644 cpp/src/arrow/vendored/fast_float/decimal_to_binary.h
create mode 100644 cpp/src/arrow/vendored/fast_float/fast_float.h
create mode 100644 cpp/src/arrow/vendored/fast_float/fast_table.h
create mode 100644 cpp/src/arrow/vendored/fast_float/float_common.h
create mode 100644 cpp/src/arrow/vendored/fast_float/parse_number.h
create mode 100644 cpp/src/arrow/vendored/fast_float/thompson_tao.h
rename
dev/tasks/linux-packages/apache-arrow/{debian/libarrow-cuda-glib200.install =>
debian.ubuntu-xenial/libarrow-cuda-glib300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libarrow-cuda200.install
=> debian.ubuntu-xenial/libarrow-cuda300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian/libarrow-dataset-glib200.install
=> debian.ubuntu-xenial/libarrow-dataset-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian/libarrow-dataset200.install =>
debian.ubuntu-xenial/libarrow-dataset300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libarrow-glib200.install
=> debian.ubuntu-xenial/libarrow-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian/libarrow-python200.install =>
debian.ubuntu-xenial/libarrow-python300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libarrow200.install =>
debian.ubuntu-xenial/libarrow300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian/libgandiva-glib200.install =>
debian.ubuntu-xenial/libgandiva-glib300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libgandiva200.install =>
debian.ubuntu-xenial/libgandiva300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian/libparquet-glib200.install =>
debian.ubuntu-xenial/libparquet-glib300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libparquet200.install =>
debian.ubuntu-xenial/libparquet300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libplasma-glib200.install
=> debian.ubuntu-xenial/libplasma-glib300.install} (100%)
rename dev/tasks/linux-packages/apache-arrow/{debian/libplasma200.install =>
debian.ubuntu-xenial/libplasma300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-cuda-glib200.install
=> debian/libarrow-cuda-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-cuda200.install
=> debian/libarrow-cuda300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-dataset-glib200.install
=> debian/libarrow-dataset-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-dataset200.install
=> debian/libarrow-dataset300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/debian/{libarrow-flight200.install =>
libarrow-flight300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-glib200.install
=> debian/libarrow-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/debian/{libarrow-python-flight200.install
=> libarrow-python-flight300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow-python200.install
=> debian/libarrow-python300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libarrow200.install
=> debian/libarrow300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libgandiva-glib200.install
=> debian/libgandiva-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libgandiva200.install
=> debian/libgandiva300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libparquet-glib200.install
=> debian/libparquet-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libparquet200.install
=> debian/libparquet300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libplasma-glib200.install
=> debian/libplasma-glib300.install} (100%)
rename
dev/tasks/linux-packages/apache-arrow/{debian.ubuntu-xenial/libplasma200.install
=> debian/libplasma300.install} (100%)
create mode 100644
java/flight/flight-core/src/main/java/org/apache/arrow/flight/BackpressureStrategy.java
rename java/{vector =>
memory/memory-core}/src/main/java/org/apache/arrow/util/AutoCloseables.java
(100%)
rename java/{vector =>
memory/memory-core}/src/main/java/org/apache/arrow/util/Collections2.java (100%)
copy java/vector/src/main/java/org/apache/arrow/vector/{DecimalVector.java =>
Decimal256Vector.java} (84%)
rename java/vector/src/main/java/org/apache/arrow/{ =>
vector}/util/DataSizeRoundingUtil.java (98%)
copy java/vector/src/test/java/org/apache/arrow/vector/{TestDecimalVector.java
=> TestDecimal256Vector.java} (75%)
rename java/vector/src/test/java/org/apache/arrow/{ =>
vector}/util/TestDataSizeRoundingUtil.java (98%)
create mode 100644 rust/arrow/test/data/list_string_dict_nested.json