[I] Avro to arrow conversion fails for nested unions [arrow]

2024-01-26 Thread via GitHub
darcysaum-toast opened a new issue, #39810: URL: https://github.com/apache/arrow/issues/39810 ### Describe the bug, including details regarding any error messages, version, and platform. Hello arrow team, I believe the Avro adapter fails on avro union types nested in records

[I] col_types of open_delim_dataset() does not work as described [arrow]

2024-01-26 Thread via GitHub
joelnitta opened a new issue, #39811: URL: https://github.com/apache/arrow/issues/39811 ### Describe the bug, including details regarding any error messages, version, and platform. (originally posted as a comment on #38903, but suggested by @thisisnic to file as its own issue)

[I] PyArrow Capsule from Nanoarrow-built Interval Arrow Yields Unexpected Values [arrow]

2024-01-26 Thread via GitHub
WillAyd opened a new issue, #39816: URL: https://github.com/apache/arrow/issues/39816 ### Describe the bug, including details regarding any error messages, version, and platform. I am trying to work with interval data passed along the new pycapsule interface. I noticed that this

[I] [Python] Add ListView and LargeListView arrays [arrow]

2024-01-26 Thread via GitHub
danepitkin opened a new issue, #39812: URL: https://github.com/apache/arrow/issues/39812 ### Describe the enhancement requested Add bindings to the C++ ListView and LargeListView arrays implemented in https://github.com/apache/arrow/pull/35345 ### Component(s) Python

[I] [Java] Checker Framework annotations for AutoCloseables in arrow-memory-core are incorrect [arrow]

2024-01-26 Thread via GitHub
lidavidm opened a new issue, #39814: URL: https://github.com/apache/arrow/issues/39814 ### Describe the enhancement requested AutoCloseables.close works with nulls and collections of nulls, but isn't annotated appropriately, making downstream usage harder. ### Component(s)

Re: [I] [C++][Parquet]: Page sizes are not always validated before truncating to int32 [arrow]

2024-01-26 Thread via GitHub
mapleFU closed issue #39527: [C++][Parquet]: Page sizes are not always validated before truncating to int32 URL: https://github.com/apache/arrow/issues/39527 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [I] [C++][Parquet] Benchmarks should present a items/s or bytes/s metric [arrow]

2024-01-26 Thread via GitHub
mapleFU closed issue #39668: [C++][Parquet] Benchmarks should present a items/s or bytes/s metric URL: https://github.com/apache/arrow/issues/39668 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] [C++][Acero] ASAN reports heap buffer overflow in `arrow::compute::Hashing32::ProcessStripes_avx2` [arrow]

2024-01-26 Thread via GitHub
pitrou closed issue #39778: [C++][Acero] ASAN reports heap buffer overflow in `arrow::compute::Hashing32::ProcessStripes_avx2` URL: https://github.com/apache/arrow/issues/39778 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [I] [14.0.2+] unable to detect the active compiler's information [arrow]

2024-01-26 Thread via GitHub
kou closed issue #39801: [14.0.2+] unable to detect the active compiler's information URL: https://github.com/apache/arrow/issues/39801 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] Dataset.to_batches accumulates memory usage and leaks [arrow]

2024-01-26 Thread via GitHub
akoumjian opened a new issue, #39808: URL: https://github.com/apache/arrow/issues/39808 ### Describe the bug, including details regarding any error messages, version, and platform. If you want to read in a large parquet file or series of parquet files, the dataset reader accumulates

Re: [I] [Java] Implement/use ServiceProvider for discovering drivers [arrow-adbc]

2024-01-26 Thread via GitHub
lidavidm closed issue #48: [Java] Implement/use ServiceProvider for discovering drivers URL: https://github.com/apache/arrow-adbc/issues/48 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[I] UTC Timezone Equality is case sensitive [arrow]

2024-01-26 Thread via GitHub
WillAyd opened a new issue, #39809: URL: https://github.com/apache/arrow/issues/39809 ### Describe the bug, including details regarding any error messages, version, and platform. Not sure if this is a bug but I found the behavior surprising ``` >>> import pyarrow as pa

[I] arrow violates the CRAN policy on downloading binary software [arrow]

2024-01-26 Thread via GitHub
rafapereirabr opened a new issue, #39806: URL: https://github.com/apache/arrow/issues/39806 ### Describe the bug, including details regarding any error messages, version, and platform. Hi all. Several package developers who maintain packages that depend on `arrow` received this

Re: [I] [C++] vendor a half precision floating point library [arrow]

2024-01-26 Thread via GitHub
pitrou closed issue #22806: [C++] vendor a half precision floating point library URL: https://github.com/apache/arrow/issues/22806 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific