Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package apache-arrow for openSUSE:Factory checked in at 2024-08-16 12:23:38 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/apache-arrow (Old) and /work/SRC/openSUSE:Factory/.apache-arrow.new.2698 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "apache-arrow" Fri Aug 16 12:23:38 2024 rev:13 rq:1194086 version:17.0.0 Changes: -------- --- /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow.changes 2024-04-25 20:50:53.117812536 +0200 +++ /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/apache-arrow.changes 2024-08-16 12:24:16.804603293 +0200 @@ -1,0 +2,181 @@ +Mon Aug 12 17:11:06 UTC 2024 - Ben Greiner <c...@bnavigator.de> + +- Update to 17.0.0 + ## Bug Fixes + * [C++] Add option to string âcenterâ kernel to control + left/right alignment on odd number of padding (#41449) + * [C++][Python] Fix casting to extension type with fixed size + list storage type (#42219) + * [C++] Replace null_count with MayHaveNulls in + ListArrayFromArray and MapArray (#41957) + * [C++][Python] RecordBatch.filter() segfaults if passed a + ChunkedArray (#40971) + * [C++][Parquet] Timestamp conversion from Parquet to Arrow does + not follow compatibility guidelines for convertedType + * [C++] Use LargeStringArray for casting when writing tables to + CSV (#40271) + * [C++][Python] Map child Array constructed from keys and items + shouldnât have offset (#40871) + * [C++] Fix compile warning with âimplicitly-defined constructor + does not initializeâ in encoding_benchmark (#41060) + * [C++] Get null_bit_id according to are_cols_in_encoding_order + in NullUpdateColumnToRow_avx2 (#40998) + * [C++] Clean up unused parameter warnings (#41111) + * [C++][Acero] Fix asof join race (#41614) + * [C++] support for single threaded joins (#41125) + * [C++] Fix hashjoin benchmark failed at make utf8âs random + batches (#41195) + * [C++] Check to avoid copying when NullBitmapBuffer is Null + (#41452) + * [C++] Fix crash on invalid Parquet file (#41366) + * [C++][Parquet] More strict Parquet level checking (#41346) + * [C++][Gandiva] Fix gandiva cache size env var (#41330) + * [C++][CMake][Windows] Remove needless .dll suffix from link + libraries (#41341) + * [C++][CMake] Remove unused ARROW_NO_DEPRECATED_API (#41345) + * [C++][maybe_unused] with Arrow macro (#41359) + * [C++][Large] ListView and Map nested types for scalar_if_elseâs + kernel functions (#41419) + * [C++][Gandiva] Fix ascii_utf8 function to return same result on + x86 and Arm (#41434) + * [C++] Reuse deduplication logic for direct registration + (#41466) + * [C++] Clean up more redundant move warnings (#41487) + * [C++][Compute] Remove redundant logic for ArrayData as + ExecResults in ExecScalarCaseWhen (#41380) + * [C++][CMake] correctly use Protobuf_PROTOC_EXECUTABLE (#41582) + * [C++][CMake] Fix ARROW_USE_BOOST detect condition (#41622) + * [C++][Python] Add optional null_bitmap to MapArray::FromArrays + (#41757) + * [C++] macros.h: Fix ARROW_FORCE_INLINE for MSVC (#41712) + * [C++][Acero] Remove an useless parameter for QueryContext::Init + called in hash_join_benchmark (#41716) + * [C++] Fix the issue that temp vector stack may be under sized + (#41746) + * [C++] Check that extension metadata key is present before + attempting to delete it (#41763) + * [C++] Iterator releases its resource immediately when it reads + all values (#41824) + * [C++][Flight][Benchmark] Ensure waiting server ready (#41793) + * [C++] Fix avx2 gather offset larger than 2GB in + CompareColumnsToRows (#42188) + * [C++][S3] Fix potential deadlock when closing output stream + (#41876) + * [CI][C++] Clear cache for mamba on AppVeyor (#41977) + * [CI][Python][C++] Fix utf8proc detection for wheel on Windows + (#42022) + * [C++] Support list-views on list_slice (#42067) + * [C++] Fix an OTel test failure and remove needless logs + (#42122) + * [C++][FS][Azure] Ensure setting BlobSasBuilder::Protocol + (#42108) + * [C++] Support list-view typed arrays in array_take and + array_filter (#42117) + * [C++] Fix some potential uninitialized variable warnings + (#42207) + * [C++] Avoid invalid accesses in parquet-encoding-benchmark + (#42141) + * [C++] Use FetchContent for bundled ORC (#43011) + * [C++] Fix GetRecordBatchPayload crashes for device data + (#42199) + * [C++] Use non-stale c-ares download URL (#42250) + * [C++][Parquet] Check for valid ciphertext length to prevent + segfault (#43071) + * [C++][Compute] Mark KeyCompare.CompareColumnsToRowsLarge as + large memory test (#43128) + * [C++] Upgrade bundled google-cloud-cpp to 2.22.0 (#43136) + ## New Features and Improvements + * [C++][Compute] Implement Grouper::Reset (#41352) + * [Go][C++] Implement Flight SQL Bulk Ingestion (#38385) + * [C++][FS][Azure] Support azure cli auth (#41976) + * [C++][FS][Azure] Add support for environment credential + (#41715) + * [C++] Optimize Take for fixed-size types including nested + fixed-size lists (#41297) + * [C++][Device] Add Copy/View slice functions to a CPU pointer + (#41477) + * [C++] Add support for OpenTelemetry logging (#39905) + * [C++] Import/Export ArrowDeviceArrayStream (#40807) + * [C++] move LocalFileSystem to the registry (#40356) + * [C++] Make flatbuffers serialization more deterministic + (#40392) + * [C++][Gandiva] add RE2::Options set_dot_nl(true) for Like + function (#40970) + * [C++] Introduce portable compiler assumptions (#41021) + * [C++] Add a grouper benchmark for preventing performance + regression (#41036) + * [C++] Support flatten for combining nested list related types + (#41092) + * [C++] Clean up remaining tasks related to half float casts + (#41084) + * [C++][FS][Azure] Add support for CopyFile with hierarchical + namespace support (#41276) + * [C++] Add is_validity_defined_by_bitmap() predicate (#41115) + * [C++] IO: enhance boundary checking in CompressedInputStream + (#41117) + * [C++][Python] Expose recursive flatten for lists on + list_flatten kernel function and pyarrow bindings (#41295) + * [C++][Parquet][Doc] Denote PARQUET:field_id in parquet.rst + (#41187) + * [C++] Extract the kernel loops used for PrimitiveTakeExec and + generalize to any fixed-width type (#41373) + * [C++][Acero] Use per-node basis temp vector stack to mitigate + overflow (#41335) + * [C++][Parquet] Optimize DelimitRecords by batch execution when + max_rep_level > 1 (#41362) + * [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API + reference (#41411) + * [C++] Use ASAN to poison temp vector stack memory (#41695) + * [C++][S3] Add a new option to check existence before CreateDir + (#41822) + * [C++][Parquet] Fix + DeltaLengthByteArrayEncoder::EstimatedDataEncodedSize (#41546) + * [C++] Thirdparty: Upgrade xsimd to 13.0.0 (#41548) + * [C++] Improve fixed_width_test_util.h (#41575) + * [C++] ChunkResolver: Implement ResolveMany and add unit tests + (#41561) + * [C++] fixed_width_internal.h: Simplify docstring and support + bit-sized types (BOOL) (#41597) + * [C++][Python] Extends the add_key_value to parquet::arrow and + PyArrow (#41633) + * [C++][CMake][Windows] Donât build needless object libraries + (#41658) + * [C++][Python] PrettyPrint non-cpu data by copying to default + CPU device (#42010) + * [C++][Parquet] Thrift: generate template method to accelerate + reading thrift (#41703) + * [C++][Parquet] Minor: moving EncodedStats by default rather + than copying (#41727) + * [C++][ORC] Ensure setting detected ORC version (#41767) + * [C++][Parquet] Add file metadata read/write benchmark (#41761) + * [C++] Make git-dependent definitions internal (#41781) + * [C++][S3] Remove GetBucketRegion hack for newer AWS SDK + versions (#41798) + * [C++][Parquet] normalize dictionary encoding to use + RLE_DICTIONARY (#41819) + * [C++] IPC: Minor enhance the code of writer (#41900) + * [C++] Fix ExecuteScalar deduce all_scalar with chunked_array + (#41925) + * [C++] Minor enhance code style for FixedShapeTensorType + (#41954) + * [C++] Follow up of adding null_bitmap to MapArray::FromArrays + (#41956) + * [C++] Misc changes making code around list-like types and + list-view types behave the same way (#41971) + * [C++] : kernel.cc: Remove defaults on switch so that compiler + can check full enum coverage for us (#41995) + * [C++][Parquet] ParquetFilePrinter::JSONPrint print length of + FLBA (#41981) + * [C++][CMake] Add preset for Valgrind (#42110) + * [C++] Move TakeXXX free functions into TakeMetaFunction and + make them private (#42127) + * [C++][FS][Azure] Validate + AzureOptions::{blob,dfs}_storage_scheme (#42135) + * [C++] list_parent_indices: Add support for list-view types + (#42236) + * [C++] Reduce the recursion of many-join test (#43042) + * [C++] Limit buffer size in BufferedInputStream::SetBufferSize + with raw_read_bound (#43064) +- Require cmake lz4 for 1.10 + +------------------------------------------------------------------- --- /work/SRC/openSUSE:Factory/apache-arrow/python-pyarrow.changes 2024-04-25 20:50:53.661832295 +0200 +++ /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes 2024-08-16 12:24:16.908607615 +0200 @@ -1,0 +2,99 @@ +Wed Aug 14 20:27:48 UTC 2024 - Ben Greiner <c...@bnavigator.de> + +- Update to 17.0.0 + ## Bug Fixes + * [C++][Python] Fix casting to extension type with fixed size + list storage type (#42219) + * [Python] Include metadata when creating pa.schema from + PyCapsule (#41538) + * [C++][Python] RecordBatch.filter() segfaults if passed a + ChunkedArray (#40971) + * [Python] pa.array: add check for byte-swapped numpy arrays + inside python objects (#41549) + * [Python] Fix read_table for encrypted parquet (#39438) + * [Python] RunEndEncodedArray.from_arrays: bugfix for Array + arguments (#40560) (#41093) + * [C++][Python] Map child Array constructed from keys and items + shouldnât have offset (#40871) + * [Python] `test_numpy_array_protocol` test failures with numpy + 2.0.0rc1 + * [Python] Fix StructArray.sort() for by=None (#41495) + * [Python] Build with Python 3.13 (#42034) + * [Python] remove special methods related to buffers in python + <2.6 (#41492) + * [Python] Fix reading column index with decimal values (#41503) + * [Docs][Python] Remove duplicate contents (#41588) + * [C++][Python] Add optional null_bitmap to MapArray::FromArrays + (#41757) + * [Python][Parquet] Implement to_dict method on SortingColumn + (#41704) + * [Python] CMake: ignore Parquet encryption option if Parquet + itself is not enabled (fix Java integration build) (#41776) + * [Python] Disallow direct pa.RecordBatchReader() construction to + avoid segfaults (#41773) + * [Python] Fix RecordBatchReader.cast to support casting to equal + schema for all types (#42098) + * [Python] Fix tests when using NumPy 2.0 on Windows (#42099) + * [CI][Python] Use pip install -e instead of setup.py build_ext + âinplace for installing pyarrow on verification script (#42007) + * [CI][Python][C++] Fix utf8proc detection for wheel on Windows + (#42022) + * [Python][CI] Update expected output for numpy 2.0.0 (#42172) + ## New Features and Improvements + * [Python] Replace pandas.util.testing.rands with vendored + version (#42089) + * [Python] begin moving static settings to pyproject.toml + (#41041) + * [Python] Implement PyCapsule interface for Device data in + PyArrow (#40717) + * [Python] Expand the Arrow PyCapsule Interface with C Device + Data support (#40708) + * [Python] Let RecordBatch.filter accept a boolean expression in + addition to mask array (#43043) + * [Python] Fix pickling of LocalFileSystem for cython 2 (#41459) + * [Python] Expand the C Device Interface bindings to support + import on CUDA device (#40385) + * [Python] Allow passing a mapping of column names to + rename_columns (#40645) + * [Python][Packaging] Strip unnecessary symbols when building + wheels (#42028) + * [Python][Docs] Update PyArrow installation docs for conda + package split (#41135) + * [Python] Basic bindings for Device and MemoryManager classes + (#41685) + * [C++][Python] Expose recursive flatten for lists on + list_flatten kernel function and pyarrow bindings (#41295) + * [Python][Packaging] Ensure to build with released numpy 2.0 + (instead of RC) in the wheel building workflows (#42194) + * [CI][Python] Add a job on ARM64 macOS (#41313) + * [CI][Python] Reduce CI time on macOS (#41378) + * [Python] Expose byte_width and bit_width of ExtensionType in + terms of the storage type (#41413) + * [Python] Update Python development guide about components being + enabled by default based on Arrow C++ (#41705) + * [Python] Building PyArrow: enable/disable python components by + default based on availability in Arrow C++ (#41494) + * [C++][Python] Extends the add_key_value to parquet::arrow and + PyArrow (#41633) + * [Python] Ensure Buffer methods donât crash with non-CPU data + (#41889) + * [C++][Python] PrettyPrint non-cpu data by copying to default + CPU device (#42010) + * [Python][Parquet] Update BYTE_STREAM_SPLIT description in + write_table() docstring (#41759) + * [Python] Add support for Pyodide (#37822) + * [Python] Fix pandas tests to follow downstream datetime64 unit + changes (#41979) + * [Python] Allow Array.filter() to take general array input + (#42051) + * [Python] Expose new FLOAT16 logical type in the pyarrow.parquet + bindings (#42103) + * [Python] Array gracefully fails on non-cpu device (#42113) + * [Python][Parquet] Pyarrow store decimal as integer (#42169) + * [Python] Add CI job for Numpy 1.X (#42189) + * [CI][Python] Pin openjdk=17 in python substrait integration + (#43051) +- Drop pyarrow-pr41319-numpy2-tests.patch +- Add pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325 + +------------------------------------------------------------------- Old: ---- apache-arrow-16.0.0.tar.gz arrow-testing-16.0.0.tar.gz parquet-testing-16.0.0.tar.gz pyarrow-pr41319-numpy2-tests.patch New: ---- apache-arrow-17.0.0.tar.gz arrow-testing-17.0.0.tar.gz parquet-testing-17.0.0.tar.gz pyarrow-pr433325-extradirs.patch BETA DEBUG BEGIN: Old:/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes- (#43051) /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes:- Drop pyarrow-pr41319-numpy2-tests.patch /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes-- Add pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325 BETA DEBUG END: BETA DEBUG BEGIN: New:/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes-- Drop pyarrow-pr41319-numpy2-tests.patch /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes:- Add pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325 /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes- BETA DEBUG END: ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ apache-arrow.spec ++++++ --- /var/tmp/diff_new_pack.LbwoBE/_old 2024-08-16 12:24:18.484673114 +0200 +++ /var/tmp/diff_new_pack.LbwoBE/_new 2024-08-16 12:24:18.488673281 +0200 @@ -22,13 +22,13 @@ # Required for runtime dispatch, not yet packaged %bcond_with xsimd -%define sonum 1600 +%define sonum 1700 # See git submodule /testing pointing to the correct revision -%define arrow_testing_commit 25d16511e8d42c2744a1d94d90169e3a36e92631 +%define arrow_testing_commit 735ae7128d571398dd798d7ff004adebeb342883 # See git submodule /cpp/submodules/parquet-testing pointing to the correct revision %define parquet_testing_commit 74278bc4a1122d74945969e6dec405abd1533ec3 Name: apache-arrow -Version: 16.0.0 +Version: 17.0.0 Release: 0 Summary: A development platform for in-memory data License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT @@ -51,6 +51,7 @@ BuildRequires: pkgconfig BuildRequires: python-rpm-macros BuildRequires: python3-base +BuildRequires: (cmake(lz4) >= 1.10 or (pkgconfig(liblz4) >= 1.8.3 with pkgconfig(liblz4) < 1.10)) BuildRequires: cmake(Snappy) >= 1.1.7 BuildRequires: cmake(absl) BuildRequires: cmake(double-conversion) >= 3.1.5 @@ -64,7 +65,6 @@ BuildRequires: pkgconfig(libbrotlienc) >= 1.0.7 BuildRequires: pkgconfig(libcares) >= 1.15.0 BuildRequires: pkgconfig(libglog) >= 0.3.5 -BuildRequires: pkgconfig(liblz4) >= 1.8.3 BuildRequires: pkgconfig(libopenssl) BuildRequires: pkgconfig(liburiparser) >= 0.9.3 BuildRequires: pkgconfig(libutf8proc) ++++++ python-pyarrow.spec ++++++ --- /var/tmp/diff_new_pack.LbwoBE/_old 2024-08-16 12:24:18.528674943 +0200 +++ /var/tmp/diff_new_pack.LbwoBE/_new 2024-08-16 12:24:18.528674943 +0200 @@ -18,17 +18,20 @@ %bcond_with xsimd %define plainpython python +# See git submodule /testing pointing to the correct revision +%define arrow_testing_commit 735ae7128d571398dd798d7ff004adebeb342883 Name: python-pyarrow -Version: 16.0.0 +Version: 17.0.0 Release: 0 Summary: Python library for Apache Arrow License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT URL: https://arrow.apache.org/ # SourceRepository: https://github.com/apache/arrow Source0: apache-arrow-%{version}.tar.gz +Source1: arrow-testing-%{version}.tar.gz Source99: python-pyarrow.rpmlintrc -# PATCH-FIX-UPSTREAM pyarrow-pr41319-numpy2-tests.patch gh#apache/arrow#41319 -Patch0: pyarrow-pr41319-numpy2-tests.patch +# PATCH-FIX-UPSTREAM pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325 +Patch0: pyarrow-pr433325-extradirs.patch BuildRequires: %{python_module Cython >= 0.29.31} BuildRequires: %{python_module devel >= 3.8} BuildRequires: %{python_module numpy-devel >= 1.25} @@ -88,7 +91,8 @@ platlib for consuming modules using cythonization. %prep -%autosetup -p1 -n arrow-apache-arrow-%{version} +%setup -n arrow-apache-arrow-%{version} -a1 +%autopatch -p1 # we disabled the jemalloc backend in apache-arrow sed -i 's/should_have_jemalloc = sys.platform == "linux"/should_have_jemalloc = False/' python/pyarrow/tests/test_memory.py @@ -122,8 +126,12 @@ popd %check -# flaky +export ARROW_TEST_DATA="${PWD}/arrow-testing-%{arrow_testing_commit}/data" +# flaky tests donttest="test_total_bytes_allocated" +donttest="$donttest or test_batch_lifetime" +# worker crashes, we don't have an s3 setup in obs anyway +donttest="$donttest or test_s3fs_limited_permissions_create_bucket" %ifarch %{ix86} %{arm32} # tests conversion to 64bit datatypes donttest="$donttest or test_conversion" ++++++ apache-arrow-16.0.0.tar.gz -> apache-arrow-17.0.0.tar.gz ++++++ /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow-16.0.0.tar.gz /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/apache-arrow-17.0.0.tar.gz differ: char 13, line 1 ++++++ arrow-testing-16.0.0.tar.gz -> arrow-testing-17.0.0.tar.gz ++++++ Binary files old/arrow-testing-25d16511e8d42c2744a1d94d90169e3a36e92631/data/arrow-ipc-stream/clusterfuzz-testcase-minimized-arrow-ipc-stream-fuzz-5048291196731392 and new/arrow-testing-735ae7128d571398dd798d7ff004adebeb342883/data/arrow-ipc-stream/clusterfuzz-testcase-minimized-arrow-ipc-stream-fuzz-5048291196731392 differ ++++++ parquet-testing-16.0.0.tar.gz -> parquet-testing-17.0.0.tar.gz ++++++ ++++ no output (probably identical) ++++++ pyarrow-pr433325-extradirs.patch ++++++ >From 888a5ae568d155d03fbff0db8849517fd24a99ff Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Thu, 18 Jul 2024 16:48:52 +0200 Subject: [PATCH 1/9] GH-43299: [Release][Packaging] Only include pyarrow and pyarrow.* when finding packages on setuptools --- python/pyproject.toml | 1 + 1 file changed, 1 insertion(+) diff --git a/python/pyproject.toml b/python/pyproject.toml index d863bb3e5f0ac..d70b7fcce5903 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -74,6 +74,7 @@ include-package-data=true [tool.setuptools.packages.find] where = ["."] +include = ["pyarrow", "pyarrow.*"] [tool.setuptools.package-data] pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"] >From 46d1afc62514ae04a3815aede7722ac5a9ecce64 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Thu, 18 Jul 2024 17:33:33 +0200 Subject: [PATCH 2/9] Update include --- python/pyproject.toml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index d70b7fcce5903..d1c5a799f870f 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -74,7 +74,7 @@ include-package-data=true [tool.setuptools.packages.find] where = ["."] -include = ["pyarrow", "pyarrow.*"] +include = ["pyarrow*"] [tool.setuptools.package-data] pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"] >From d954d75432f05723fca0644842deafd941802842 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Thu, 18 Jul 2024 18:00:40 +0200 Subject: [PATCH 3/9] try again without * --- python/pyproject.toml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index d1c5a799f870f..222f8d2ece681 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -74,7 +74,7 @@ include-package-data=true [tool.setuptools.packages.find] where = ["."] -include = ["pyarrow*"] +include = ["pyarrow"] [tool.setuptools.package-data] pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"] >From 2fa434ffc03cca1a251c80c51dd6e98f63db19d1 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Thu, 18 Jul 2024 18:36:55 +0200 Subject: [PATCH 4/9] Exclude tests from wheels --- python/pyproject.toml | 2 ++ 1 file changed, 2 insertions(+) diff --git a/python/pyproject.toml b/python/pyproject.toml index 222f8d2ece681..45c3b60c8aeed 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -75,6 +75,8 @@ include-package-data=true [tool.setuptools.packages.find] where = ["."] include = ["pyarrow"] +exclude = ["pyarrow.tests"] +namespaces = false [tool.setuptools.package-data] pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"] >From 204a27b0534161a35e2d79241dcadd0471341c2a Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Fri, 19 Jul 2024 12:43:29 +0200 Subject: [PATCH 5/9] Try excluding pyarrow. and pyarrow/tests explicitly --- python/pyproject.toml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index 45c3b60c8aeed..d675f07a82391 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -75,7 +75,7 @@ include-package-data=true [tool.setuptools.packages.find] where = ["."] include = ["pyarrow"] -exclude = ["pyarrow.tests"] +exclude = ["pyarrow/tests", "pyarrow."] namespaces = false [tool.setuptools.package-data] >From a1d73a28e3d6e57366ff43d06389a2f3fa47c7de Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Fri, 19 Jul 2024 13:40:25 +0200 Subject: [PATCH 6/9] Try removing where from packages find --- python/pyproject.toml | 1 - 1 file changed, 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index d675f07a82391..9a91fd76a4a20 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -73,7 +73,6 @@ zip-safe=false include-package-data=true [tool.setuptools.packages.find] -where = ["."] include = ["pyarrow"] exclude = ["pyarrow/tests", "pyarrow."] namespaces = false >From 346c0f1982735cac2a4b76a13efbb2a201bf158f Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Mon, 22 Jul 2024 14:46:54 +0200 Subject: [PATCH 7/9] Try with pyarrow.tests --- python/pyproject.toml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index 9a91fd76a4a20..d83cf8fe45d8c 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -74,7 +74,7 @@ include-package-data=true [tool.setuptools.packages.find] include = ["pyarrow"] -exclude = ["pyarrow/tests", "pyarrow."] +exclude = ["pyarrow.tests"] namespaces = false [tool.setuptools.package-data] >From f6273223a1b006406bf315f41424be03a51a3b1e Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Mon, 22 Jul 2024 15:34:31 +0200 Subject: [PATCH 8/9] Remove excludes --- python/pyproject.toml | 1 - 1 file changed, 1 deletion(-) diff --git a/python/pyproject.toml b/python/pyproject.toml index d83cf8fe45d8c..7e14795428315 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -74,7 +74,6 @@ include-package-data=true [tool.setuptools.packages.find] include = ["pyarrow"] -exclude = ["pyarrow.tests"] namespaces = false [tool.setuptools.package-data] >From f855f0c14fbc4703123e36924f1641cf4a48396a Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com> Date: Thu, 25 Jul 2024 11:33:51 +0200 Subject: [PATCH 9/9] Remove PYARROW_INSTALL_TESTS and packages definition on setup.py --- ci/scripts/python_wheel_macos_build.sh | 1 - ci/scripts/python_wheel_manylinux_build.sh | 1 - ci/scripts/python_wheel_windows_build.bat | 1 - docs/source/developers/python.rst | 3 --- python/setup.py | 16 +--------------- 5 files changed, 1 insertion(+), 21 deletions(-) diff --git a/ci/scripts/python_wheel_macos_build.sh b/ci/scripts/python_wheel_macos_build.sh index 3ed9d5d8dd12f..6c314d0632f60 100755 --- a/ci/scripts/python_wheel_macos_build.sh +++ b/ci/scripts/python_wheel_macos_build.sh @@ -152,7 +152,6 @@ echo "=== (${PYTHON_VERSION}) Building wheel ===" export PYARROW_BUILD_TYPE=${CMAKE_BUILD_TYPE} export PYARROW_BUNDLE_ARROW_CPP=1 export PYARROW_CMAKE_GENERATOR=${CMAKE_GENERATOR} -export PYARROW_INSTALL_TESTS=1 export PYARROW_WITH_ACERO=${ARROW_ACERO} export PYARROW_WITH_AZURE=${ARROW_AZURE} export PYARROW_WITH_DATASET=${ARROW_DATASET} diff --git a/ci/scripts/python_wheel_manylinux_build.sh b/ci/scripts/python_wheel_manylinux_build.sh index aa86494a9d47d..b5b45c54a800d 100755 --- a/ci/scripts/python_wheel_manylinux_build.sh +++ b/ci/scripts/python_wheel_manylinux_build.sh @@ -140,7 +140,6 @@ echo "=== (${PYTHON_VERSION}) Building wheel ===" export PYARROW_BUILD_TYPE=${CMAKE_BUILD_TYPE} export PYARROW_BUNDLE_ARROW_CPP=1 export PYARROW_CMAKE_GENERATOR=${CMAKE_GENERATOR} -export PYARROW_INSTALL_TESTS=1 export PYARROW_WITH_ACERO=${ARROW_ACERO} export PYARROW_WITH_AZURE=${ARROW_AZURE} export PYARROW_WITH_DATASET=${ARROW_DATASET} diff --git a/ci/scripts/python_wheel_windows_build.bat b/ci/scripts/python_wheel_windows_build.bat index 54f02ec6f6ed0..1f1d5dca721d9 100644 --- a/ci/scripts/python_wheel_windows_build.bat +++ b/ci/scripts/python_wheel_windows_build.bat @@ -106,7 +106,6 @@ echo "=== (%PYTHON_VERSION%) Building wheel ===" set PYARROW_BUILD_TYPE=%CMAKE_BUILD_TYPE% set PYARROW_BUNDLE_ARROW_CPP=ON set PYARROW_CMAKE_GENERATOR=%CMAKE_GENERATOR% -set PYARROW_INSTALL_TESTS=ON set PYARROW_WITH_ACERO=%ARROW_ACERO% set PYARROW_WITH_DATASET=%ARROW_DATASET% set PYARROW_WITH_FLIGHT=%ARROW_FLIGHT% diff --git a/docs/source/developers/python.rst b/docs/source/developers/python.rst index 2f3e892ce8ede..bed095b4b8d11 100644 --- a/docs/source/developers/python.rst +++ b/docs/source/developers/python.rst @@ -632,9 +632,6 @@ PyArrow are: * - ``PYARROW_BUNDLE_CYTHON_CPP`` - Bundle the C++ files generated by Cython - ``0`` (``OFF``) - * - ``PYARROW_INSTALL_TESTS`` - - Add the test to the python package - - ``1`` (``ON``) * - ``PYARROW_BUILD_VERBOSE`` - Enable verbose output from Makefile builds - ``0`` (``OFF``) diff --git a/python/setup.py b/python/setup.py index 11cd7028023be..c4517d21c42f1 100755 --- a/python/setup.py +++ b/python/setup.py @@ -32,7 +32,7 @@ from distutils import sysconfig import pkg_resources -from setuptools import setup, Extension, Distribution, find_namespace_packages +from setuptools import setup, Extension, Distribution from Cython.Distutils import build_ext as _build_ext import Cython @@ -371,21 +371,7 @@ def has_ext_modules(foo): return True -if strtobool(os.environ.get('PYARROW_INSTALL_TESTS', '1')): - packages = find_namespace_packages(include=['pyarrow*']) - exclude_package_data = {} -else: - packages = find_namespace_packages(include=['pyarrow*'], - exclude=["pyarrow.tests*"]) - # setuptools adds back importable packages even when excluded. - # https://github.com/pypa/setuptools/issues/3260 - # https://github.com/pypa/setuptools/issues/3340#issuecomment-1219383976 - exclude_package_data = {"pyarrow": ["tests*"]} - - setup( - packages=packages, - exclude_package_data=exclude_package_data, distclass=BinaryDistribution, # Dummy extension to trigger build_ext ext_modules=[Extension('__dummy__', sources=[])],