[PR] chore: pin FlatBuffers to 25.2.10. Generate matching bindings. [arrow-swift]

2025-10-11 Thread via GitHub
willtemperley opened a new pull request, #93: URL: https://github.com/apache/arrow-swift/pull/93 There is a mismatch between the FlatBuffers runtime which is 25.2.10 and the bindings which were generated with 23.1.4. ## What's Changed The files have been generated by FlatBuffer

Re: [I] `arrow::compute::interleave` fails with struct arrays with no fields [arrow-rs]

2025-10-11 Thread via GitHub
alamb closed issue #8533: `arrow::compute::interleave` fails with struct arrays with no fields URL: https://github.com/apache/arrow-rs/issues/8533 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] fix: Support `interleave_struct` to handle empty fields [arrow-rs]

2025-10-11 Thread via GitHub
alamb merged PR #8563: URL: https://github.com/apache/arrow-rs/pull/8563 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache

Re: [PR] Skip redundant validation checks in RecordBatch#project [arrow-rs]

2025-10-11 Thread via GitHub
pepijnve commented on code in PR #8583: URL: https://github.com/apache/arrow-rs/pull/8583#discussion_r2422831913 ## arrow-array/Cargo.toml: ## @@ -80,3 +80,7 @@ harness = false [[bench]] name = "union_array" harness = false + +[[bench]] +name = "record_batch" Review Comment:

[PR] Add RecordBatch::project microbenchmark [arrow-rs]

2025-10-11 Thread via GitHub
pepijnve opened a new pull request, #8592: URL: https://github.com/apache/arrow-rs/pull/8592 # Which issue does this PR close? - Related to #8591. # Rationale for this change Add a microbenchmark for `RecordBatch::project` to measure the performance impact of #8583

Re: [PR] Casting support for RunEndEncoded arrays [arrow-rs]

2025-10-11 Thread via GitHub
vegarsti commented on code in PR #8589: URL: https://github.com/apache/arrow-rs/pull/8589#discussion_r2423348620 ## arrow-cast/src/cast/run_array.rs: ## @@ -0,0 +1,506 @@ +use crate::cast::*; + +/// Attempts to cast a Run-End Encoded array to another type, handling both REE-to-

Re: [PR] GH-47632: [CI][C++] Add a CI job for JNI on Linux [arrow]

2025-10-11 Thread via GitHub
kou commented on PR #47746: URL: https://github.com/apache/arrow/pull/47746#issuecomment-3393255722 @raulcd Can we merge this for 22.0.0? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

Re: [I] [Docs][Python] Move Conda instructions to second option in dev docs setup [arrow]

2025-10-11 Thread via GitHub
xhochy commented on issue #47735: URL: https://github.com/apache/arrow/issues/47735#issuecomment-3376712945 > Given that this has been mentioned as a tricker setup, let's make things easier for newbies and move this section down and start with the other one. As the one who got through

[PR] [Parquet] Remove unused parquet::format crate [arrow-rs]

2025-10-11 Thread via GitHub
lichuang opened a new pull request, #8593: URL: https://github.com/apache/arrow-rs/pull/8593 # Which issue does this PR close? Remove unused parquet::format crate - Closes #8572. # Rationale for this change Why are you proposing this change? If this is already expl

Re: [PR] [Draft] Implemented casting for RunEnd Encoding (pt2) [arrow-rs]

2025-10-11 Thread via GitHub
vegarsti commented on PR #8384: URL: https://github.com/apache/arrow-rs/pull/8384#issuecomment-3392934938 Opened https://github.com/apache/arrow-rs/pull/8589 cc @brancz @Rich-T-kid -- This is an automated message from the Apache Git Service. To respond to the message, please log on to Git

Re: [PR] GH-47784: [C++] Patch vendored pcg library to enable msvc arm64 intrinsics [arrow]

2025-10-11 Thread via GitHub
kou commented on PR #47779: URL: https://github.com/apache/arrow/pull/47779#issuecomment-3393238782 Hmm. It seems that https://github.com/imneme/pcg-cpp isn't maintained... Do we have alternative implementation...? https://github.com/imneme/pcg-cpp was vendored by #8879. @pitrou

Re: [PR] GH-47438: Set up wheel building for Python 3.14 [arrow]

2025-10-11 Thread via GitHub
AlenkaF commented on PR #47616: URL: https://github.com/apache/arrow/pull/47616#issuecomment-3375133545 Thank you @raulcd ! (My archery docker run is still in progress 🤷‍♀️) I will look at the free-threaded changes in the meantime and try to fix any issues. -- This is an automated mess

Re: [PR] feat: Enhance `Map` display formatting in DataType [arrow-rs]

2025-10-11 Thread via GitHub
alamb merged PR #8570: URL: https://github.com/apache/arrow-rs/pull/8570 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache

Re: [PR] Use `Arc` to reduce size of ParquetMetadata and avoid copying when `encryption` is enabled [arrow-rs]

2025-10-11 Thread via GitHub
alamb commented on PR #8470: URL: https://github.com/apache/arrow-rs/pull/8470#issuecomment-3393299690 No problems, -- I will get it ship shape! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

Re: [PR] GH-47588: [C++] Bump mimalloc version to 3.1.5 [arrow]

2025-10-11 Thread via GitHub
github-actions[bot] commented on PR #47589: URL: https://github.com/apache/arrow/pull/47589#issuecomment-3379984069 ``` Invalid group(s) {'linux'}. Must be one of {'verify-rc-source-linux', 'homebrew', 'r', 'fuzz', 'c-glib', 'ruby', 'example-cpp', 'verify-rc-source-macos', 'conan', 'cpp'

Re: [PR] Casting support for RunEndEncoded arrays [arrow-rs]

2025-10-11 Thread via GitHub
vegarsti commented on code in PR #8589: URL: https://github.com/apache/arrow-rs/pull/8589#discussion_r2422973686 ## arrow-cast/src/cast/run_array.rs: ## @@ -0,0 +1,506 @@ +use crate::cast::*; + +/// Attempts to cast a Run-End Encoded array to another type, handling both REE-to-

Re: [I] [EPIC] Faster performance for parquet predicate evaluation for non selective filters [arrow-rs]

2025-10-11 Thread via GitHub
XiangpengHao commented on issue #7456: URL: https://github.com/apache/arrow-rs/issues/7456#issuecomment-3393476392 @hhhizzz go for it! I’ve been quite busy lately and probably won’t be able to pick it up for a few months, so it’d be awesome if you take the lead. -- This is an automated me

Re: [PR] Skip redundant validation checks in RecordBatch#project [arrow-rs]

2025-10-11 Thread via GitHub
pepijnve commented on PR #8583: URL: https://github.com/apache/arrow-rs/pull/8583#issuecomment-3393125414 Some micro benchmark results ``` project/100x100 -> 1x100 time: [85.588 ns 85.747 ns 85.903 ns] change: [-7.3087% -7.0345

[I] Eliminate redundant validation in `RecordBatch::project` [arrow-rs]

2025-10-11 Thread via GitHub
pepijnve opened a new issue, #8591: URL: https://github.com/apache/arrow-rs/issues/8591 **Is your feature request related to a problem or challenge? Please describe what you are trying to do.** `RecordBatch::project` currently uses the validating constructor function `try_new_with_op

Re: [PR] GH-42173: [R][S3] Writing partitionned dataset with Rarrow on S3 failed if ListBucket is not allowed for the user [arrow]

2025-10-11 Thread via GitHub
simonelbaz commented on PR #47599: URL: https://github.com/apache/arrow/pull/47599#issuecomment-3388966317 Hi, @jonkeane @thisisnic thanks for any comment || review -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and us

Re: [PR] GH-46465: [C++][FlightRPC] Refactor ODBC namespaces and file structure [arrow]

2025-10-11 Thread via GitHub
lidavidm merged PR #47703: URL: https://github.com/apache/arrow/pull/47703 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apac

Re: [I] [CI][Python] ORC test_timezone_absent is not raising expected Exception [arrow]

2025-10-11 Thread via GitHub
raulcd commented on issue #47778: URL: https://github.com/apache/arrow/issues/47778#issuecomment-3390538940 I've validated locally that I can reproduce the issue with: ``` $ archery docker run -e SETUPTOOLS_SCM_PRETEND_VERSION="22.0.0.dev199" ubuntu-python ``` I've also valid

Re: [I] Troubleshooting bulk insert performance of Snowflake connector [arrow-adbc]

2025-10-11 Thread via GitHub
CurtHagenlocher commented on issue #3480: URL: https://github.com/apache/arrow-adbc/issues/3480#issuecomment-3390548348 Thanks @sfc-gh-pfus! This is very interesting. What the ADBC driver is actually doing is creating the stage with those options ``` CREATE OR REPLACE TEMPORARY STAGE A

Re: [PR] GH-46411: [C++] Implemented dataset option in Meson [arrow]

2025-10-11 Thread via GitHub
kou commented on code in PR #47669: URL: https://github.com/apache/arrow/pull/47669#discussion_r2422887917 ## cpp/meson.build: ## @@ -52,24 +52,28 @@ if git_description == '' and not meson.is_subproject() endif needs_benchmarks = get_option('benchmarks').enabled() -needs_com

Re: [PR] [thrift-remodel] Refactor Thrift encryption and store encodings as bitmask [arrow-rs]

2025-10-11 Thread via GitHub
etseidl commented on code in PR #8587: URL: https://github.com/apache/arrow-rs/pull/8587#discussion_r2423089770 ## parquet/src/basic.rs: ## @@ -724,6 +725,123 @@ impl FromStr for Encoding { } } +/// A bitmask representing the [`Encoding`]s employed while encoding a Parqu