Re: [I] [Packaging] Drop support for Ubuntu 23.04 [arrow]
raulcd closed issue #38909: [Packaging] Drop support for Ubuntu 23.04 URL: https://github.com/apache/arrow/issues/38909 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Python][[Packaging] Wheels are failing for manylinux python 3.8 due to pip not being found [arrow]
raulcd opened a new issue, #38984: URL: https://github.com/apache/arrow/issues/38984 ### Describe the bug, including details regarding any error messages, version, and platform. The wheels for manylinux have been failing for the last 6-7 days: * [wheel-manylinux-2-28-cp38-amd64](https://github.com/ursacomputing/crossbow/actions/runs/7016111614/job/19086656171) * [wheel-manylinux-2-28-cp38-arm64](https://github.com/ursacomputing/crossbow/actions/runs/7016110429/job/19086653225) * [wheel-manylinux-2014-cp38-amd64](https://github.com/ursacomputing/crossbow/actions/runs/7016112297/job/19086658977) * [wheel-manylinux-2014-cp38-arm64](https://github.com/ursacomputing/crossbow/actions/runs/7016110200/job/19086652161) This is reproducible on the maintenance branch for 14.0.2 and has to be fixed for the release. The error is: ``` #10 98.73 npm notice #10 98.74 + python3 -m pip install -U pip #10 98.75 /usr/bin/python3: No module named pip #10 ERROR: process "/bin/sh -c /setup-rhel-rebuilds.sh && rm /setup-rhel-rebuilds.sh && dnf -y clean all" did not complete successfully: exit code: 1 ``` ### Component(s) Packaging, Python -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [R][Release] {docs/source/_static,r/pkgdown/assets}/versions.json have an invalid entry [arrow]
raulcd closed issue #38981: [R][Release] {docs/source/_static,r/pkgdown/assets}/versions.json have an invalid entry URL: https://github.com/apache/arrow/issues/38981 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [C++][Gandiva] Refactor function holder to return arrow Result [arrow]
pitrou closed issue #38920: [C++][Gandiva] Refactor function holder to return arrow Result URL: https://github.com/apache/arrow/issues/38920 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Go] Allow access to the underlying `MemoTable` of a dictionary builder [arrow]
ella-chao opened a new issue, #38988: URL: https://github.com/apache/arrow/issues/38988 ### Describe the enhancement requested I have a case where knowing the size of the dictionary as values get appended to the dictionary builder will be useful. Specifically, I am indexing data where the number of unique values is unknown. As the number of unique values is more likely to be relatively small in this case, a `BinaryDictionaryBuilder` is used and only when it is detected that the dictionary will be too big do I fall back to a `LargeStringBuilder`. The issue is that there is no easy way to figure out the size of the dictionary in a `BinaryDictionaryBuilder` today. As a workaround, after each `AppendString` to the `BinaryDictionaryBuilder` I do the following ``` lastDictIndex := bldrDictString.(*arrowarray.BinaryDictionaryBuilder).GetValueIndex(i) if lastDictIndex+1 > cardinality { cardinality = lastDictIndex + 1 } ``` where `i` is the index of the value appended. It would be more convenient and potentially less costly if the `MemoTable` or even just the size of the dictionary is exposed. Do you think this is something that you will be open to? I will be happy to open a PR if so. ### Component(s) Go -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [Java] NullVector field name can't be set [arrow]
danepitkin closed issue #27275: [Java] NullVector field name can't be set URL: https://github.com/apache/arrow/issues/27275 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [Java][flight-sql-jdbc] Expose app_metadata for FlightInfo through JDBC interfaces [arrow]
lidavidm closed issue #38024: [Java][flight-sql-jdbc] Expose app_metadata for FlightInfo through JDBC interfaces URL: https://github.com/apache/arrow/issues/38024 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [C++][Gandiva] to_date_utf8_utf8_int32 function's suppress_errors parameter causes segfault for non literal input [arrow]
kou closed issue #38879: [C++][Gandiva] to_date_utf8_utf8_int32 function's suppress_errors parameter causes segfault for non literal input URL: https://github.com/apache/arrow/issues/38879 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [R] Remove automatic download of `cmake` when not found [arrow]
paleolimbot closed issue #38047: [R] Remove automatic download of `cmake` when not found URL: https://github.com/apache/arrow/issues/38047 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [R] Fix failing windows build on CI [arrow]
paleolimbot closed issue #38906: [R] Fix failing windows build on CI URL: https://github.com/apache/arrow/issues/38906 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] java: update to Arrow 14 [arrow-adbc]
lidavidm closed issue #1095: java: update to Arrow 14 URL: https://github.com/apache/arrow-adbc/issues/1095 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [Python] DB-API 2.0 execute query always call `createPreparedStatement` first [arrow]
xinyiZzz closed issue #38786: [Python] DB-API 2.0 execute query always call `createPreparedStatement` first URL: https://github.com/apache/arrow/issues/38786 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [C++][Parquet] Using length to optimize bloom filter read [arrow]
pitrou closed issue #38860: [C++][Parquet] Using length to optimize bloom filter read URL: https://github.com/apache/arrow/issues/38860 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] csharp/src/Drivers/BigQuery: Failure when returning empty schema data [arrow-adbc]
CurtHagenlocher closed issue #1329: csharp/src/Drivers/BigQuery: Failure when returning empty schema data URL: https://github.com/apache/arrow-adbc/issues/1329 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] Add string view to the C ABI [arrow]
bkietz closed issue #36099: Add string view to the C ABI URL: https://github.com/apache/arrow/issues/36099 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [Java] When loading NullVector, valueCount should be properly set [arrow]
pitrou closed issue #34610: [Java] When loading NullVector, valueCount should be properly set URL: https://github.com/apache/arrow/issues/34610 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Build memory-core and memory-unsafe as JPMS modules [arrow]
jduo opened a new issue, #38998: URL: https://github.com/apache/arrow/issues/38998 ### Describe the enhancement requested Build memory-core and memory-unsafe as JPMS modules. - Refactor the memory module's structures to faciliate this. - Update arrow-vector's module-info.java file to depend on memory-core's These modules use Unsafe and reflection and require adding --add-opens command line flags since they are now named modules. Note that memory-netty is omitted because it requires a bigger change to the user's command-line. ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [C++][FS][Azure] Add allow_container_deletion option [arrow]
kou opened a new issue, #38999: URL: https://github.com/apache/arrow/issues/38999 ### Describe the enhancement requested S3 filesystem have the `allow_bucket_deletion` option. See also the discussion at https://github.com/apache/arrow/pull/3#discussion_r1409832642 . This is a child of https://github.com/apache/arrow/issues/18014. ### Component(s) C++ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Update dependencies and plugins for JPMS module support [arrow]
jduo opened a new issue, #38996: URL: https://github.com/apache/arrow/issues/38996 ### Describe the enhancement requested Several dependencies and plugins need to be updated to support modules: - gRPC 1.59 has improved module support - CycloneDX needs to be upgraded to work with gRPC 1.59. - Standardize projects to use the same version of maven-shade-plugin when possible. - maven-dependency-plugin needs an update to be able to read module-info.class files. - Jackson 2.15.1 has a bug that prevents modules from loading it correctly. - maven-compiler-plugin needs to be updated to 3.11.0 to allow turning off use of the module-path ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [C#] Spelling errors identified by check-spelling [arrow]
CurtHagenlocher closed issue #38942: [C#] Spelling errors identified by check-spelling URL: https://github.com/apache/arrow/issues/38942 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Build memory-netty as a JPMS module [arrow]
jduo opened a new issue, #39000: URL: https://github.com/apache/arrow/issues/39000 ### Describe the enhancement requested Build memory-netty as a JPMS module. This change requires patching Netty's public module since we add classes to exposed packages and call package-private functions. This requires users to add a patch-module command to their java invocation. It is somewhat inconvenient because it needs to reference one of our delivered JAR filenames (which vary by location and version) ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] go/adbc/driver/snowflake: Without the DB name the GetObjects call fails [arrow-adbc]
ryan-syed opened a new issue, #1332: URL: https://github.com/apache/arrow-adbc/issues/1332 Currently the Snowflake Go driver needs the database name to make the `GetObjects` call. However, ODBC based driver just requires the server name and warehouse. To make the experience more inline with other existing snowflake drivers, should the `GetObjects` be able get the information without needing a database being passed as a default. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Upgrade to flatc version 23.5.26 [arrow]
danepitkin opened a new issue, #38990: URL: https://github.com/apache/arrow/issues/38990 ### Describe the enhancement requested flatc v23.5.26 is the same version now used in C++. See https://github.com/apache/arrow/pull/38650#issuecomment-180420 ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Build arrow-format and arrow-vector as JPMS modules [arrow]
jduo opened a new issue, #38997: URL: https://github.com/apache/arrow/issues/38997 ### Describe the enhancement requested Bulid the first set of core arrow dependencies as JPMS modules while retaining JDK 8 support. Note that arrow-memory and related modules are not included because they require more changes on the Java command line (due to use of reflection, Unsafe, and needing to patch netty's module). ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] [Java] Build remaining modules as JPMS modules [arrow]
jduo opened a new issue, #39001: URL: https://github.com/apache/arrow/issues/39001 ### Describe the enhancement requested Build the following modules as JPMS module: - adapter modules - algorithm - c - compression - dataset - flight-core - flight-sql - gandiva - tools flight-sql-jdbc-driver should not be built as a module since it's not intended to be statically linked. Likely performance and flight-integration-tests should not either since they are for testing purposes only. ### Component(s) Java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [I] [C++] xsimd is used in public headers [arrow]
pitrou closed issue #38907: [C++] xsimd is used in public headers URL: https://github.com/apache/arrow/issues/38907 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[I] Function 'binary_join' has no kernel matching input types (list, string) [arrow]
rami-lv opened a new issue, #38986: URL: https://github.com/apache/arrow/issues/38986 ### Describe the bug, including details regarding any error messages, version, and platform. `binary_join` function from `pyarrow.compute` works for `list ` but not for `list` I tried casting but that also did not work. The parquet file was produced by Apache Spark, I will try to come up with a script to reproduce the bug if necessary. ### Component(s) Python -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org