Script 'mail_helper' called by obssrc
Hello community,

here is the log from the commit of package apache-arrow for openSUSE:Factory 
checked in at 2024-08-16 12:23:38
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Comparing /work/SRC/openSUSE:Factory/apache-arrow (Old)
 and      /work/SRC/openSUSE:Factory/.apache-arrow.new.2698 (New)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Package is "apache-arrow"

Fri Aug 16 12:23:38 2024 rev:13 rq:1194086 version:17.0.0

Changes:
--------
--- /work/SRC/openSUSE:Factory/apache-arrow/apache-arrow.changes        
2024-04-25 20:50:53.117812536 +0200
+++ /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/apache-arrow.changes      
2024-08-16 12:24:16.804603293 +0200
@@ -1,0 +2,181 @@
+Mon Aug 12 17:11:06 UTC 2024 - Ben Greiner <c...@bnavigator.de>
+
+- Update to 17.0.0
+  ## Bug Fixes
+  * [C++] Add option to string ‘center’ kernel to control
+    left/right alignment on odd number of padding (#41449)
+  * [C++][Python] Fix casting to extension type with fixed size
+    list storage type (#42219)
+  * [C++] Replace null_count with MayHaveNulls in
+    ListArrayFromArray and MapArray (#41957)
+  * [C++][Python] RecordBatch.filter() segfaults if passed a
+    ChunkedArray (#40971)
+  * [C++][Parquet] Timestamp conversion from Parquet to Arrow does
+    not follow compatibility guidelines for convertedType
+  * [C++] Use LargeStringArray for casting when writing tables to
+    CSV (#40271)
+  * [C++][Python] Map child Array constructed from keys and items
+    shouldn’t have offset (#40871)
+  * [C++] Fix compile warning with ‘implicitly-defined constructor
+    does not initialize’ in encoding_benchmark (#41060)
+  * [C++] Get null_bit_id according to are_cols_in_encoding_order
+    in NullUpdateColumnToRow_avx2 (#40998)
+  * [C++] Clean up unused parameter warnings (#41111)
+  * [C++][Acero] Fix asof join race (#41614)
+  * [C++] support for single threaded joins (#41125)
+  * [C++] Fix hashjoin benchmark failed at make utf8’s random
+    batches (#41195)
+  * [C++] Check to avoid copying when NullBitmapBuffer is Null
+    (#41452)
+  * [C++] Fix crash on invalid Parquet file (#41366)
+  * [C++][Parquet] More strict Parquet level checking (#41346)
+  * [C++][Gandiva] Fix gandiva cache size env var (#41330)
+  * [C++][CMake][Windows] Remove needless .dll suffix from link
+    libraries (#41341)
+  * [C++][CMake] Remove unused ARROW_NO_DEPRECATED_API (#41345)
+  * [C++][maybe_unused] with Arrow macro (#41359)
+  * [C++][Large] ListView and Map nested types for scalar_if_else’s
+    kernel functions (#41419)
+  * [C++][Gandiva] Fix ascii_utf8 function to return same result on
+    x86 and Arm (#41434)
+  * [C++] Reuse deduplication logic for direct registration
+    (#41466)
+  * [C++] Clean up more redundant move warnings (#41487)
+  * [C++][Compute] Remove redundant logic for ArrayData as
+    ExecResults in ExecScalarCaseWhen (#41380)
+  * [C++][CMake] correctly use Protobuf_PROTOC_EXECUTABLE (#41582)
+  * [C++][CMake] Fix ARROW_USE_BOOST detect condition (#41622)
+  * [C++][Python] Add optional null_bitmap to MapArray::FromArrays
+    (#41757)
+  * [C++] macros.h: Fix ARROW_FORCE_INLINE for MSVC (#41712)
+  * [C++][Acero] Remove an useless parameter for QueryContext::Init
+    called in hash_join_benchmark (#41716)
+  * [C++] Fix the issue that temp vector stack may be under sized
+    (#41746)
+  * [C++] Check that extension metadata key is present before
+    attempting to delete it (#41763)
+  * [C++] Iterator releases its resource immediately when it reads
+    all values (#41824)
+  * [C++][Flight][Benchmark] Ensure waiting server ready (#41793)
+  * [C++] Fix avx2 gather offset larger than 2GB in
+    CompareColumnsToRows (#42188)
+  * [C++][S3] Fix potential deadlock when closing output stream
+    (#41876)
+  * [CI][C++] Clear cache for mamba on AppVeyor (#41977)
+  * [CI][Python][C++] Fix utf8proc detection for wheel on Windows
+    (#42022)
+  * [C++] Support list-views on list_slice (#42067)
+  * [C++] Fix an OTel test failure and remove needless logs
+    (#42122)
+  * [C++][FS][Azure] Ensure setting BlobSasBuilder::Protocol
+    (#42108)
+  * [C++] Support list-view typed arrays in array_take and
+    array_filter (#42117)
+  * [C++] Fix some potential uninitialized variable warnings
+    (#42207)
+  * [C++] Avoid invalid accesses in parquet-encoding-benchmark
+    (#42141)
+  * [C++] Use FetchContent for bundled ORC (#43011)
+  * [C++] Fix GetRecordBatchPayload crashes for device data
+    (#42199)
+  * [C++] Use non-stale c-ares download URL (#42250)
+  * [C++][Parquet] Check for valid ciphertext length to prevent
+    segfault (#43071)
+  * [C++][Compute] Mark KeyCompare.CompareColumnsToRowsLarge as
+    large memory test (#43128)
+  * [C++] Upgrade bundled google-cloud-cpp to 2.22.0 (#43136)
+  ## New Features and Improvements
+  * [C++][Compute] Implement Grouper::Reset (#41352)
+  * [Go][C++] Implement Flight SQL Bulk Ingestion (#38385)
+  * [C++][FS][Azure] Support azure cli auth (#41976)
+  * [C++][FS][Azure] Add support for environment credential
+    (#41715)
+  * [C++] Optimize Take for fixed-size types including nested
+    fixed-size lists (#41297)
+  * [C++][Device] Add Copy/View slice functions to a CPU pointer
+    (#41477)
+  * [C++] Add support for OpenTelemetry logging (#39905)
+  * [C++] Import/Export ArrowDeviceArrayStream (#40807)
+  * [C++] move LocalFileSystem to the registry (#40356)
+  * [C++] Make flatbuffers serialization more deterministic
+    (#40392)
+  * [C++][Gandiva] add RE2::Options set_dot_nl(true) for Like
+    function (#40970)
+  * [C++] Introduce portable compiler assumptions (#41021)
+  * [C++] Add a grouper benchmark for preventing performance
+    regression (#41036)
+  * [C++] Support flatten for combining nested list related types
+    (#41092)
+  * [C++] Clean up remaining tasks related to half float casts
+    (#41084)
+  * [C++][FS][Azure] Add support for CopyFile with hierarchical
+    namespace support (#41276)
+  * [C++] Add is_validity_defined_by_bitmap() predicate (#41115)
+  * [C++] IO: enhance boundary checking in CompressedInputStream
+    (#41117)
+  * [C++][Python] Expose recursive flatten for lists on
+    list_flatten kernel function and pyarrow bindings (#41295)
+  * [C++][Parquet][Doc] Denote PARQUET:field_id in parquet.rst
+    (#41187)
+  * [C++] Extract the kernel loops used for PrimitiveTakeExec and
+    generalize to any fixed-width type (#41373)
+  * [C++][Acero] Use per-node basis temp vector stack to mitigate
+    overflow (#41335)
+  * [C++][Parquet] Optimize DelimitRecords by batch execution when
+    max_rep_level > 1 (#41362)
+  * [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API
+    reference (#41411)
+  * [C++] Use ASAN to poison temp vector stack memory (#41695)
+  * [C++][S3] Add a new option to check existence before CreateDir
+    (#41822)
+  * [C++][Parquet] Fix
+    DeltaLengthByteArrayEncoder::EstimatedDataEncodedSize (#41546)
+  * [C++] Thirdparty: Upgrade xsimd to 13.0.0 (#41548)
+  * [C++] Improve fixed_width_test_util.h (#41575)
+  * [C++] ChunkResolver: Implement ResolveMany and add unit tests
+    (#41561)
+  * [C++] fixed_width_internal.h: Simplify docstring and support
+    bit-sized types (BOOL) (#41597)
+  * [C++][Python] Extends the add_key_value to parquet::arrow and
+    PyArrow (#41633)
+  * [C++][CMake][Windows] Don’t build needless object libraries
+    (#41658)
+  * [C++][Python] PrettyPrint non-cpu data by copying to default
+    CPU device (#42010)
+  * [C++][Parquet] Thrift: generate template method to accelerate
+    reading thrift (#41703)
+  * [C++][Parquet] Minor: moving EncodedStats by default rather
+    than copying (#41727)
+  * [C++][ORC] Ensure setting detected ORC version (#41767)
+  * [C++][Parquet] Add file metadata read/write benchmark (#41761)
+  * [C++] Make git-dependent definitions internal (#41781)
+  * [C++][S3] Remove GetBucketRegion hack for newer AWS SDK
+    versions (#41798)
+  * [C++][Parquet] normalize dictionary encoding to use
+    RLE_DICTIONARY (#41819)
+  * [C++] IPC: Minor enhance the code of writer (#41900)
+  * [C++] Fix ExecuteScalar deduce all_scalar with chunked_array
+    (#41925)
+  * [C++] Minor enhance code style for FixedShapeTensorType
+    (#41954)
+  * [C++] Follow up of adding null_bitmap to MapArray::FromArrays
+    (#41956)
+  * [C++] Misc changes making code around list-like types and
+    list-view types behave the same way (#41971)
+  * [C++] : kernel.cc: Remove defaults on switch so that compiler
+    can check full enum coverage for us (#41995)
+  * [C++][Parquet] ParquetFilePrinter::JSONPrint print length of
+    FLBA (#41981)
+  * [C++][CMake] Add preset for Valgrind (#42110)
+  * [C++] Move TakeXXX free functions into TakeMetaFunction and
+    make them private (#42127)
+  * [C++][FS][Azure] Validate
+    AzureOptions::{blob,dfs}_storage_scheme (#42135)
+  * [C++] list_parent_indices: Add support for list-view types
+    (#42236)
+  * [C++] Reduce the recursion of many-join test (#43042)
+  * [C++] Limit buffer size in BufferedInputStream::SetBufferSize
+    with raw_read_bound (#43064)
+- Require cmake lz4 for 1.10
+
+-------------------------------------------------------------------
--- /work/SRC/openSUSE:Factory/apache-arrow/python-pyarrow.changes      
2024-04-25 20:50:53.661832295 +0200
+++ /work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes    
2024-08-16 12:24:16.908607615 +0200
@@ -1,0 +2,99 @@
+Wed Aug 14 20:27:48 UTC 2024 - Ben Greiner <c...@bnavigator.de>
+
+- Update to 17.0.0
+  ## Bug Fixes
+  * [C++][Python] Fix casting to extension type with fixed size
+    list storage type (#42219)
+  * [Python] Include metadata when creating pa.schema from
+    PyCapsule (#41538)
+  * [C++][Python] RecordBatch.filter() segfaults if passed a
+    ChunkedArray (#40971)
+  * [Python] pa.array: add check for byte-swapped numpy arrays
+    inside python objects (#41549)
+  * [Python] Fix read_table for encrypted parquet (#39438)
+  * [Python] RunEndEncodedArray.from_arrays: bugfix for Array
+    arguments (#40560) (#41093)
+  * [C++][Python] Map child Array constructed from keys and items
+    shouldn’t have offset (#40871)
+  * [Python] `test_numpy_array_protocol` test failures with numpy
+    2.0.0rc1
+  * [Python] Fix StructArray.sort() for by=None (#41495)
+  * [Python] Build with Python 3.13 (#42034)
+  * [Python] remove special methods related to buffers in python
+    <2.6 (#41492)
+  * [Python] Fix reading column index with decimal values (#41503)
+  * [Docs][Python] Remove duplicate contents (#41588)
+  * [C++][Python] Add optional null_bitmap to MapArray::FromArrays
+    (#41757)
+  * [Python][Parquet] Implement to_dict method on SortingColumn
+    (#41704)
+  * [Python] CMake: ignore Parquet encryption option if Parquet
+    itself is not enabled (fix Java integration build) (#41776)
+  * [Python] Disallow direct pa.RecordBatchReader() construction to
+    avoid segfaults (#41773)
+  * [Python] Fix RecordBatchReader.cast to support casting to equal
+    schema for all types (#42098)
+  * [Python] Fix tests when using NumPy 2.0 on Windows (#42099)
+  * [CI][Python] Use pip install -e instead of setup.py build_ext
+    –inplace for installing pyarrow on verification script (#42007)
+  * [CI][Python][C++] Fix utf8proc detection for wheel on Windows
+    (#42022)
+  * [Python][CI] Update expected output for numpy 2.0.0 (#42172)
+  ## New Features and Improvements
+  * [Python] Replace pandas.util.testing.rands with vendored
+    version (#42089)
+  * [Python] begin moving static settings to pyproject.toml
+    (#41041)
+  * [Python] Implement PyCapsule interface for Device data in
+    PyArrow (#40717)
+  * [Python] Expand the Arrow PyCapsule Interface with C Device
+    Data support (#40708)
+  * [Python] Let RecordBatch.filter accept a boolean expression in
+    addition to mask array (#43043)
+  * [Python] Fix pickling of LocalFileSystem for cython 2 (#41459)
+  * [Python] Expand the C Device Interface bindings to support
+    import on CUDA device (#40385)
+  * [Python] Allow passing a mapping of column names to
+    rename_columns (#40645)
+  * [Python][Packaging] Strip unnecessary symbols when building
+    wheels (#42028)
+  * [Python][Docs] Update PyArrow installation docs for conda
+    package split (#41135)
+  * [Python] Basic bindings for Device and MemoryManager classes
+    (#41685)
+  * [C++][Python] Expose recursive flatten for lists on
+    list_flatten kernel function and pyarrow bindings (#41295)
+  * [Python][Packaging] Ensure to build with released numpy 2.0
+    (instead of RC) in the wheel building workflows (#42194)
+  * [CI][Python] Add a job on ARM64 macOS (#41313)
+  * [CI][Python] Reduce CI time on macOS (#41378)
+  * [Python] Expose byte_width and bit_width of ExtensionType in
+    terms of the storage type (#41413)
+  * [Python] Update Python development guide about components being
+    enabled by default based on Arrow C++ (#41705)
+  * [Python] Building PyArrow: enable/disable python components by
+    default based on availability in Arrow C++ (#41494)
+  * [C++][Python] Extends the add_key_value to parquet::arrow and
+    PyArrow (#41633)
+  * [Python] Ensure Buffer methods don’t crash with non-CPU data
+    (#41889)
+  * [C++][Python] PrettyPrint non-cpu data by copying to default
+    CPU device (#42010)
+  * [Python][Parquet] Update BYTE_STREAM_SPLIT description in
+    write_table() docstring (#41759)
+  * [Python] Add support for Pyodide (#37822)
+  * [Python] Fix pandas tests to follow downstream datetime64 unit
+    changes (#41979)
+  * [Python] Allow Array.filter() to take general array input
+    (#42051)
+  * [Python] Expose new FLOAT16 logical type in the pyarrow.parquet
+    bindings (#42103)
+  * [Python] Array gracefully fails on non-cpu device (#42113)
+  * [Python][Parquet] Pyarrow store decimal as integer (#42169)
+  * [Python] Add CI job for Numpy 1.X (#42189)
+  * [CI][Python] Pin openjdk=17 in python substrait integration
+    (#43051)
+- Drop pyarrow-pr41319-numpy2-tests.patch
+- Add pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325
+
+-------------------------------------------------------------------

Old:
----
  apache-arrow-16.0.0.tar.gz
  arrow-testing-16.0.0.tar.gz
  parquet-testing-16.0.0.tar.gz
  pyarrow-pr41319-numpy2-tests.patch

New:
----
  apache-arrow-17.0.0.tar.gz
  arrow-testing-17.0.0.tar.gz
  parquet-testing-17.0.0.tar.gz
  pyarrow-pr433325-extradirs.patch

BETA DEBUG BEGIN:
  Old:/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes- 
   (#43051)
/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes:- Drop 
pyarrow-pr41319-numpy2-tests.patch
/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes-- Add 
pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325
BETA DEBUG END:

BETA DEBUG BEGIN:
  
New:/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes-- 
Drop pyarrow-pr41319-numpy2-tests.patch
/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes:- Add 
pyarrow-pr433325-extradirs.patch gh#apache/arrow/pull/43325
/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/python-pyarrow.changes-
BETA DEBUG END:

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Other differences:
------------------
++++++ apache-arrow.spec ++++++
--- /var/tmp/diff_new_pack.LbwoBE/_old  2024-08-16 12:24:18.484673114 +0200
+++ /var/tmp/diff_new_pack.LbwoBE/_new  2024-08-16 12:24:18.488673281 +0200
@@ -22,13 +22,13 @@
 # Required for runtime dispatch, not yet packaged
 %bcond_with xsimd
 
-%define sonum   1600
+%define sonum   1700
 # See git submodule /testing pointing to the correct revision
-%define arrow_testing_commit 25d16511e8d42c2744a1d94d90169e3a36e92631
+%define arrow_testing_commit 735ae7128d571398dd798d7ff004adebeb342883
 # See git submodule /cpp/submodules/parquet-testing pointing to the correct 
revision
 %define parquet_testing_commit 74278bc4a1122d74945969e6dec405abd1533ec3
 Name:           apache-arrow
-Version:        16.0.0
+Version:        17.0.0
 Release:        0
 Summary:        A development platform for in-memory data
 License:        Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT
@@ -51,6 +51,7 @@
 BuildRequires:  pkgconfig
 BuildRequires:  python-rpm-macros
 BuildRequires:  python3-base
+BuildRequires:  (cmake(lz4) >= 1.10 or (pkgconfig(liblz4) >= 1.8.3 with 
pkgconfig(liblz4) < 1.10))
 BuildRequires:  cmake(Snappy) >= 1.1.7
 BuildRequires:  cmake(absl)
 BuildRequires:  cmake(double-conversion) >= 3.1.5
@@ -64,7 +65,6 @@
 BuildRequires:  pkgconfig(libbrotlienc) >= 1.0.7
 BuildRequires:  pkgconfig(libcares) >= 1.15.0
 BuildRequires:  pkgconfig(libglog) >= 0.3.5
-BuildRequires:  pkgconfig(liblz4) >= 1.8.3
 BuildRequires:  pkgconfig(libopenssl)
 BuildRequires:  pkgconfig(liburiparser) >= 0.9.3
 BuildRequires:  pkgconfig(libutf8proc)

++++++ python-pyarrow.spec ++++++
--- /var/tmp/diff_new_pack.LbwoBE/_old  2024-08-16 12:24:18.528674943 +0200
+++ /var/tmp/diff_new_pack.LbwoBE/_new  2024-08-16 12:24:18.528674943 +0200
@@ -18,17 +18,20 @@
 
 %bcond_with xsimd
 %define plainpython python
+# See git submodule /testing pointing to the correct revision
+%define arrow_testing_commit 735ae7128d571398dd798d7ff004adebeb342883
 Name:           python-pyarrow
-Version:        16.0.0
+Version:        17.0.0
 Release:        0
 Summary:        Python library for Apache Arrow
 License:        Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT
 URL:            https://arrow.apache.org/
 # SourceRepository: https://github.com/apache/arrow
 Source0:        apache-arrow-%{version}.tar.gz
+Source1:        arrow-testing-%{version}.tar.gz
 Source99:       python-pyarrow.rpmlintrc
-# PATCH-FIX-UPSTREAM pyarrow-pr41319-numpy2-tests.patch gh#apache/arrow#41319
-Patch0:         pyarrow-pr41319-numpy2-tests.patch
+# PATCH-FIX-UPSTREAM pyarrow-pr433325-extradirs.patch 
gh#apache/arrow/pull/43325
+Patch0:         pyarrow-pr433325-extradirs.patch
 BuildRequires:  %{python_module Cython >= 0.29.31}
 BuildRequires:  %{python_module devel >= 3.8}
 BuildRequires:  %{python_module numpy-devel >= 1.25}
@@ -88,7 +91,8 @@
 platlib for consuming modules using cythonization.
 
 %prep
-%autosetup -p1 -n arrow-apache-arrow-%{version}
+%setup -n arrow-apache-arrow-%{version} -a1
+%autopatch -p1
 # we disabled the jemalloc backend in apache-arrow
 sed -i 's/should_have_jemalloc = sys.platform == "linux"/should_have_jemalloc 
= False/' python/pyarrow/tests/test_memory.py
 
@@ -122,8 +126,12 @@
 popd
 
 %check
-# flaky
+export ARROW_TEST_DATA="${PWD}/arrow-testing-%{arrow_testing_commit}/data"
+# flaky tests
 donttest="test_total_bytes_allocated"
+donttest="$donttest or test_batch_lifetime"
+# worker crashes, we don't have an s3 setup in obs anyway
+donttest="$donttest or test_s3fs_limited_permissions_create_bucket"
 %ifarch %{ix86} %{arm32}
 # tests conversion to 64bit datatypes
 donttest="$donttest or test_conversion"

++++++ apache-arrow-16.0.0.tar.gz -> apache-arrow-17.0.0.tar.gz ++++++
/work/SRC/openSUSE:Factory/apache-arrow/apache-arrow-16.0.0.tar.gz 
/work/SRC/openSUSE:Factory/.apache-arrow.new.2698/apache-arrow-17.0.0.tar.gz 
differ: char 13, line 1

++++++ arrow-testing-16.0.0.tar.gz -> arrow-testing-17.0.0.tar.gz ++++++
Binary files 
old/arrow-testing-25d16511e8d42c2744a1d94d90169e3a36e92631/data/arrow-ipc-stream/clusterfuzz-testcase-minimized-arrow-ipc-stream-fuzz-5048291196731392
 and 
new/arrow-testing-735ae7128d571398dd798d7ff004adebeb342883/data/arrow-ipc-stream/clusterfuzz-testcase-minimized-arrow-ipc-stream-fuzz-5048291196731392
 differ

++++++ parquet-testing-16.0.0.tar.gz -> parquet-testing-17.0.0.tar.gz ++++++
++++ no output (probably identical)

++++++ pyarrow-pr433325-extradirs.patch ++++++
>From 888a5ae568d155d03fbff0db8849517fd24a99ff Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Thu, 18 Jul 2024 16:48:52 +0200
Subject: [PATCH 1/9] GH-43299: [Release][Packaging] Only include pyarrow and
 pyarrow.* when finding packages on setuptools

---
 python/pyproject.toml | 1 +
 1 file changed, 1 insertion(+)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index d863bb3e5f0ac..d70b7fcce5903 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -74,6 +74,7 @@ include-package-data=true
 
 [tool.setuptools.packages.find]
 where = ["."]
+include = ["pyarrow", "pyarrow.*"]
 
 [tool.setuptools.package-data]
 pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"]

>From 46d1afc62514ae04a3815aede7722ac5a9ecce64 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Thu, 18 Jul 2024 17:33:33 +0200
Subject: [PATCH 2/9] Update include

---
 python/pyproject.toml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index d70b7fcce5903..d1c5a799f870f 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -74,7 +74,7 @@ include-package-data=true
 
 [tool.setuptools.packages.find]
 where = ["."]
-include = ["pyarrow", "pyarrow.*"]
+include = ["pyarrow*"]
 
 [tool.setuptools.package-data]
 pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"]

>From d954d75432f05723fca0644842deafd941802842 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Thu, 18 Jul 2024 18:00:40 +0200
Subject: [PATCH 3/9] try again without *

---
 python/pyproject.toml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index d1c5a799f870f..222f8d2ece681 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -74,7 +74,7 @@ include-package-data=true
 
 [tool.setuptools.packages.find]
 where = ["."]
-include = ["pyarrow*"]
+include = ["pyarrow"]
 
 [tool.setuptools.package-data]
 pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"]

>From 2fa434ffc03cca1a251c80c51dd6e98f63db19d1 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Thu, 18 Jul 2024 18:36:55 +0200
Subject: [PATCH 4/9] Exclude tests from wheels

---
 python/pyproject.toml | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index 222f8d2ece681..45c3b60c8aeed 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -75,6 +75,8 @@ include-package-data=true
 [tool.setuptools.packages.find]
 where = ["."]
 include = ["pyarrow"]
+exclude = ["pyarrow.tests"]
+namespaces = false
 
 [tool.setuptools.package-data]
 pyarrow = ["*.pxd", "*.pyx", "includes/*.pxd"]

>From 204a27b0534161a35e2d79241dcadd0471341c2a Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Fri, 19 Jul 2024 12:43:29 +0200
Subject: [PATCH 5/9] Try excluding pyarrow. and pyarrow/tests explicitly

---
 python/pyproject.toml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index 45c3b60c8aeed..d675f07a82391 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -75,7 +75,7 @@ include-package-data=true
 [tool.setuptools.packages.find]
 where = ["."]
 include = ["pyarrow"]
-exclude = ["pyarrow.tests"]
+exclude = ["pyarrow/tests", "pyarrow."]
 namespaces = false
 
 [tool.setuptools.package-data]

>From a1d73a28e3d6e57366ff43d06389a2f3fa47c7de Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Fri, 19 Jul 2024 13:40:25 +0200
Subject: [PATCH 6/9] Try removing where from packages find

---
 python/pyproject.toml | 1 -
 1 file changed, 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index d675f07a82391..9a91fd76a4a20 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -73,7 +73,6 @@ zip-safe=false
 include-package-data=true
 
 [tool.setuptools.packages.find]
-where = ["."]
 include = ["pyarrow"]
 exclude = ["pyarrow/tests", "pyarrow."]
 namespaces = false

>From 346c0f1982735cac2a4b76a13efbb2a201bf158f Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Mon, 22 Jul 2024 14:46:54 +0200
Subject: [PATCH 7/9] Try with pyarrow.tests

---
 python/pyproject.toml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index 9a91fd76a4a20..d83cf8fe45d8c 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -74,7 +74,7 @@ include-package-data=true
 
 [tool.setuptools.packages.find]
 include = ["pyarrow"]
-exclude = ["pyarrow/tests", "pyarrow."]
+exclude = ["pyarrow.tests"]
 namespaces = false
 
 [tool.setuptools.package-data]

>From f6273223a1b006406bf315f41424be03a51a3b1e Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Mon, 22 Jul 2024 15:34:31 +0200
Subject: [PATCH 8/9] Remove excludes

---
 python/pyproject.toml | 1 -
 1 file changed, 1 deletion(-)

diff --git a/python/pyproject.toml b/python/pyproject.toml
index d83cf8fe45d8c..7e14795428315 100644
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
@@ -74,7 +74,6 @@ include-package-data=true
 
 [tool.setuptools.packages.find]
 include = ["pyarrow"]
-exclude = ["pyarrow.tests"]
 namespaces = false
 
 [tool.setuptools.package-data]

>From f855f0c14fbc4703123e36924f1641cf4a48396a Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Ra=C3=BAl=20Cumplido?= <raulcumpl...@gmail.com>
Date: Thu, 25 Jul 2024 11:33:51 +0200
Subject: [PATCH 9/9] Remove PYARROW_INSTALL_TESTS and packages definition on
 setup.py

---
 ci/scripts/python_wheel_macos_build.sh     |  1 -
 ci/scripts/python_wheel_manylinux_build.sh |  1 -
 ci/scripts/python_wheel_windows_build.bat  |  1 -
 docs/source/developers/python.rst          |  3 ---
 python/setup.py                            | 16 +---------------
 5 files changed, 1 insertion(+), 21 deletions(-)

diff --git a/ci/scripts/python_wheel_macos_build.sh 
b/ci/scripts/python_wheel_macos_build.sh
index 3ed9d5d8dd12f..6c314d0632f60 100755
--- a/ci/scripts/python_wheel_macos_build.sh
+++ b/ci/scripts/python_wheel_macos_build.sh
@@ -152,7 +152,6 @@ echo "=== (${PYTHON_VERSION}) Building wheel ==="
 export PYARROW_BUILD_TYPE=${CMAKE_BUILD_TYPE}
 export PYARROW_BUNDLE_ARROW_CPP=1
 export PYARROW_CMAKE_GENERATOR=${CMAKE_GENERATOR}
-export PYARROW_INSTALL_TESTS=1
 export PYARROW_WITH_ACERO=${ARROW_ACERO}
 export PYARROW_WITH_AZURE=${ARROW_AZURE}
 export PYARROW_WITH_DATASET=${ARROW_DATASET}
diff --git a/ci/scripts/python_wheel_manylinux_build.sh 
b/ci/scripts/python_wheel_manylinux_build.sh
index aa86494a9d47d..b5b45c54a800d 100755
--- a/ci/scripts/python_wheel_manylinux_build.sh
+++ b/ci/scripts/python_wheel_manylinux_build.sh
@@ -140,7 +140,6 @@ echo "=== (${PYTHON_VERSION}) Building wheel ==="
 export PYARROW_BUILD_TYPE=${CMAKE_BUILD_TYPE}
 export PYARROW_BUNDLE_ARROW_CPP=1
 export PYARROW_CMAKE_GENERATOR=${CMAKE_GENERATOR}
-export PYARROW_INSTALL_TESTS=1
 export PYARROW_WITH_ACERO=${ARROW_ACERO}
 export PYARROW_WITH_AZURE=${ARROW_AZURE}
 export PYARROW_WITH_DATASET=${ARROW_DATASET}
diff --git a/ci/scripts/python_wheel_windows_build.bat 
b/ci/scripts/python_wheel_windows_build.bat
index 54f02ec6f6ed0..1f1d5dca721d9 100644
--- a/ci/scripts/python_wheel_windows_build.bat
+++ b/ci/scripts/python_wheel_windows_build.bat
@@ -106,7 +106,6 @@ echo "=== (%PYTHON_VERSION%) Building wheel ==="
 set PYARROW_BUILD_TYPE=%CMAKE_BUILD_TYPE%
 set PYARROW_BUNDLE_ARROW_CPP=ON
 set PYARROW_CMAKE_GENERATOR=%CMAKE_GENERATOR%
-set PYARROW_INSTALL_TESTS=ON
 set PYARROW_WITH_ACERO=%ARROW_ACERO%
 set PYARROW_WITH_DATASET=%ARROW_DATASET%
 set PYARROW_WITH_FLIGHT=%ARROW_FLIGHT%
diff --git a/docs/source/developers/python.rst 
b/docs/source/developers/python.rst
index 2f3e892ce8ede..bed095b4b8d11 100644
--- a/docs/source/developers/python.rst
+++ b/docs/source/developers/python.rst
@@ -632,9 +632,6 @@ PyArrow are:
    * - ``PYARROW_BUNDLE_CYTHON_CPP``
      - Bundle the C++ files generated by Cython
      - ``0`` (``OFF``)
-   * - ``PYARROW_INSTALL_TESTS``
-     - Add the test to the python package
-     - ``1`` (``ON``)
    * - ``PYARROW_BUILD_VERBOSE``
      - Enable verbose output from Makefile builds
      - ``0`` (``OFF``)
diff --git a/python/setup.py b/python/setup.py
index 11cd7028023be..c4517d21c42f1 100755
--- a/python/setup.py
+++ b/python/setup.py
@@ -32,7 +32,7 @@
     from distutils import sysconfig
 
 import pkg_resources
-from setuptools import setup, Extension, Distribution, find_namespace_packages
+from setuptools import setup, Extension, Distribution
 
 from Cython.Distutils import build_ext as _build_ext
 import Cython
@@ -371,21 +371,7 @@ def has_ext_modules(foo):
         return True
 
 
-if strtobool(os.environ.get('PYARROW_INSTALL_TESTS', '1')):
-    packages = find_namespace_packages(include=['pyarrow*'])
-    exclude_package_data = {}
-else:
-    packages = find_namespace_packages(include=['pyarrow*'],
-                                       exclude=["pyarrow.tests*"])
-    # setuptools adds back importable packages even when excluded.
-    # https://github.com/pypa/setuptools/issues/3260
-    # https://github.com/pypa/setuptools/issues/3340#issuecomment-1219383976
-    exclude_package_data = {"pyarrow": ["tests*"]}
-
-
 setup(
-    packages=packages,
-    exclude_package_data=exclude_package_data,
     distclass=BinaryDistribution,
     # Dummy extension to trigger build_ext
     ext_modules=[Extension('__dummy__', sources=[])],

Reply via email to