Re: [I] [Packaging] Drop support for Ubuntu 23.04 [arrow]

2023-11-29 Thread via GitHub


raulcd closed issue #38909: [Packaging] Drop support for Ubuntu 23.04
URL: https://github.com/apache/arrow/issues/38909


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Python][[Packaging] Wheels are failing for manylinux python 3.8 due to pip not being found [arrow]

2023-11-29 Thread via GitHub


raulcd opened a new issue, #38984:
URL: https://github.com/apache/arrow/issues/38984

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   The wheels for manylinux have been failing for the last 6-7 days:
   * 
[wheel-manylinux-2-28-cp38-amd64](https://github.com/ursacomputing/crossbow/actions/runs/7016111614/job/19086656171)
   * 
[wheel-manylinux-2-28-cp38-arm64](https://github.com/ursacomputing/crossbow/actions/runs/7016110429/job/19086653225)
   * 
[wheel-manylinux-2014-cp38-amd64](https://github.com/ursacomputing/crossbow/actions/runs/7016112297/job/19086658977)
   * 
[wheel-manylinux-2014-cp38-arm64](https://github.com/ursacomputing/crossbow/actions/runs/7016110200/job/19086652161)
   
   This is reproducible on the maintenance branch for 14.0.2 and has to be 
fixed for the release.
   The error is:
   ```
#10 98.73 npm notice 
   #10 98.74 + python3 -m pip install -U pip
   #10 98.75 /usr/bin/python3: No module named pip
   #10 ERROR: process "/bin/sh -c /setup-rhel-rebuilds.sh && rm 
/setup-rhel-rebuilds.sh && dnf -y clean all" did not complete successfully: 
exit code: 1
   ```
   
   ### Component(s)
   
   Packaging, Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R][Release] {docs/source/_static,r/pkgdown/assets}/versions.json have an invalid entry [arrow]

2023-11-29 Thread via GitHub


raulcd closed issue #38981: [R][Release] 
{docs/source/_static,r/pkgdown/assets}/versions.json have an invalid entry
URL: https://github.com/apache/arrow/issues/38981


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++][Gandiva] Refactor function holder to return arrow Result [arrow]

2023-11-29 Thread via GitHub


pitrou closed issue #38920: [C++][Gandiva] Refactor function holder to return 
arrow Result
URL: https://github.com/apache/arrow/issues/38920


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Go] Allow access to the underlying `MemoTable` of a dictionary builder [arrow]

2023-11-29 Thread via GitHub


ella-chao opened a new issue, #38988:
URL: https://github.com/apache/arrow/issues/38988

   ### Describe the enhancement requested
   
   I have a case where knowing the size of the dictionary as values get 
appended to the dictionary builder will be useful. Specifically, I am indexing 
data where the number of unique values is unknown. As the number of unique 
values is more likely to be relatively small in this case, a 
`BinaryDictionaryBuilder` is used and only when it is detected that the 
dictionary will be too big do I fall back to a `LargeStringBuilder`.
   
   The issue is that there is no easy way to figure out the size of the 
dictionary in a `BinaryDictionaryBuilder` today. As a workaround, after each 
`AppendString` to the `BinaryDictionaryBuilder` I do the following
   ```
   lastDictIndex := 
bldrDictString.(*arrowarray.BinaryDictionaryBuilder).GetValueIndex(i)
   if lastDictIndex+1 > cardinality {
   cardinality = lastDictIndex + 1
   }
   ```
   where `i` is the index of the value appended.
   
   It would be more convenient and potentially less costly if the `MemoTable` 
or even just the size of the dictionary is exposed. Do you think this is 
something that you will be open to? I will be happy to open a PR if so.
   
   
   
   ### Component(s)
   
   Go


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Java] NullVector field name can't be set [arrow]

2023-11-29 Thread via GitHub


danepitkin closed issue #27275: [Java] NullVector field name can't be set
URL: https://github.com/apache/arrow/issues/27275


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Java][flight-sql-jdbc] Expose app_metadata for FlightInfo through JDBC interfaces [arrow]

2023-11-29 Thread via GitHub


lidavidm closed issue #38024: [Java][flight-sql-jdbc] Expose app_metadata for 
FlightInfo through JDBC interfaces
URL: https://github.com/apache/arrow/issues/38024


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++][Gandiva] to_date_utf8_utf8_int32 function's suppress_errors parameter causes segfault for non literal input [arrow]

2023-11-29 Thread via GitHub


kou closed issue #38879: [C++][Gandiva] to_date_utf8_utf8_int32 function's 
suppress_errors parameter causes segfault for non literal input
URL: https://github.com/apache/arrow/issues/38879


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] Remove automatic download of `cmake` when not found [arrow]

2023-11-29 Thread via GitHub


paleolimbot closed issue #38047: [R] Remove automatic download of `cmake` when 
not found
URL: https://github.com/apache/arrow/issues/38047


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] Fix failing windows build on CI [arrow]

2023-11-29 Thread via GitHub


paleolimbot closed issue #38906: [R] Fix failing windows build on CI
URL: https://github.com/apache/arrow/issues/38906


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] java: update to Arrow 14 [arrow-adbc]

2023-11-29 Thread via GitHub


lidavidm closed issue #1095: java: update to Arrow 14
URL: https://github.com/apache/arrow-adbc/issues/1095


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Python] DB-API 2.0 execute query always call `createPreparedStatement` first [arrow]

2023-11-29 Thread via GitHub


xinyiZzz closed issue #38786: [Python] DB-API 2.0 execute query always call 
`createPreparedStatement` first
URL: https://github.com/apache/arrow/issues/38786


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++][Parquet] Using length to optimize bloom filter read [arrow]

2023-11-29 Thread via GitHub


pitrou closed issue #38860: [C++][Parquet] Using length to optimize bloom 
filter read
URL: https://github.com/apache/arrow/issues/38860


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] csharp/src/Drivers/BigQuery: Failure when returning empty schema data [arrow-adbc]

2023-11-29 Thread via GitHub


CurtHagenlocher closed issue #1329: csharp/src/Drivers/BigQuery: Failure when 
returning empty schema data
URL: https://github.com/apache/arrow-adbc/issues/1329


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Add string view to the C ABI [arrow]

2023-11-29 Thread via GitHub


bkietz closed issue #36099: Add string view to the C ABI
URL: https://github.com/apache/arrow/issues/36099


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Java] When loading NullVector, valueCount should be properly set [arrow]

2023-11-29 Thread via GitHub


pitrou closed issue #34610: [Java] When loading NullVector, valueCount should 
be properly set
URL: https://github.com/apache/arrow/issues/34610


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Build memory-core and memory-unsafe as JPMS modules [arrow]

2023-11-29 Thread via GitHub


jduo opened a new issue, #38998:
URL: https://github.com/apache/arrow/issues/38998

   ### Describe the enhancement requested
   
   Build memory-core and memory-unsafe as JPMS modules.
   
   - Refactor the memory module's structures to faciliate this.
   - Update arrow-vector's module-info.java file to depend on memory-core's 
   
   These modules use Unsafe and reflection and require adding --add-opens 
command line flags since they are now named modules.
   
   Note that memory-netty is omitted because it requires a bigger change to the 
user's command-line.
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [C++][FS][Azure] Add allow_container_deletion option [arrow]

2023-11-29 Thread via GitHub


kou opened a new issue, #38999:
URL: https://github.com/apache/arrow/issues/38999

   ### Describe the enhancement requested
   
   S3 filesystem have the `allow_bucket_deletion` option.
   
   See also the discussion at 
https://github.com/apache/arrow/pull/3#discussion_r1409832642 .
   
   This is a child of https://github.com/apache/arrow/issues/18014.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Update dependencies and plugins for JPMS module support [arrow]

2023-11-29 Thread via GitHub


jduo opened a new issue, #38996:
URL: https://github.com/apache/arrow/issues/38996

   ### Describe the enhancement requested
   
   Several dependencies and plugins need to be updated to support modules:
   
   - gRPC 1.59 has improved module support
   - CycloneDX needs to be upgraded to work with gRPC 1.59.
   - Standardize projects to use the same version of maven-shade-plugin when 
possible.
   - maven-dependency-plugin needs an update to be able to read 
module-info.class files.
   - Jackson 2.15.1 has a bug that prevents modules from loading it correctly.
   - maven-compiler-plugin needs to be updated to 3.11.0 to allow turning off 
use of the module-path
   
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C#] Spelling errors identified by check-spelling [arrow]

2023-11-29 Thread via GitHub


CurtHagenlocher closed issue #38942: [C#] Spelling errors identified by 
check-spelling
URL: https://github.com/apache/arrow/issues/38942


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Build memory-netty as a JPMS module [arrow]

2023-11-29 Thread via GitHub


jduo opened a new issue, #39000:
URL: https://github.com/apache/arrow/issues/39000

   ### Describe the enhancement requested
   
   Build memory-netty as a JPMS module.
   
   This change requires patching Netty's public module since we add classes to 
exposed packages and call package-private functions.
   
   This requires users to add a patch-module command to their java invocation. 
It is somewhat inconvenient because it needs to reference one of our delivered 
JAR filenames (which vary by location and version)
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] go/adbc/driver/snowflake: Without the DB name the GetObjects call fails [arrow-adbc]

2023-11-29 Thread via GitHub


ryan-syed opened a new issue, #1332:
URL: https://github.com/apache/arrow-adbc/issues/1332

   Currently the Snowflake Go driver needs the database name to make the 
`GetObjects` call. However, ODBC based driver just requires the server name and 
warehouse. To make the experience more inline with other existing snowflake 
drivers, should the `GetObjects` be able get the information without needing a 
database being passed as a default.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Upgrade to flatc version 23.5.26 [arrow]

2023-11-29 Thread via GitHub


danepitkin opened a new issue, #38990:
URL: https://github.com/apache/arrow/issues/38990

   ### Describe the enhancement requested
   
   flatc v23.5.26 is the same version now used in C++. See 
https://github.com/apache/arrow/pull/38650#issuecomment-180420
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Build arrow-format and arrow-vector as JPMS modules [arrow]

2023-11-29 Thread via GitHub


jduo opened a new issue, #38997:
URL: https://github.com/apache/arrow/issues/38997

   ### Describe the enhancement requested
   
   Bulid the first set of core arrow dependencies as JPMS modules while 
retaining JDK 8 support.
   
   Note that arrow-memory and related modules are not included because they 
require more changes on the Java command line (due to use of reflection, 
Unsafe, and needing to patch netty's module).
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] [Java] Build remaining modules as JPMS modules [arrow]

2023-11-29 Thread via GitHub


jduo opened a new issue, #39001:
URL: https://github.com/apache/arrow/issues/39001

   ### Describe the enhancement requested
   
   Build the following modules as JPMS module:
   - adapter modules
   - algorithm
   - c
   - compression
   - dataset
   - flight-core
   - flight-sql
   - gandiva
   - tools
   
   flight-sql-jdbc-driver should not be built as a module since it's not 
intended to be statically linked.
   Likely performance and flight-integration-tests should not either since they 
are for testing purposes only.
   
   
   ### Component(s)
   
   Java


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] xsimd is used in public headers [arrow]

2023-11-29 Thread via GitHub


pitrou closed issue #38907: [C++] xsimd is used in public headers
URL: https://github.com/apache/arrow/issues/38907


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[I] Function 'binary_join' has no kernel matching input types (list, string) [arrow]

2023-11-29 Thread via GitHub


rami-lv opened a new issue, #38986:
URL: https://github.com/apache/arrow/issues/38986

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   `binary_join` function from `pyarrow.compute` works for `list ` but not for `list` 
   I tried casting but that also did not work.
   
   The parquet file was produced by Apache Spark, I will try to come up with a 
script to reproduce the bug if necessary. 
   
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org