[I] [C#] Add initial GObject based Dataset library [arrow]

2024-05-29 Thread via GitHub
adamreeve opened a new issue, #41885: URL: https://github.com/apache/arrow/issues/41885 ### Describe the enhancement requested As discussed in [this mailing list thread](https://lists.apache.org/thread/5jfk0fcqy90cl8w6v45ny50pwrgfpp1y), I think it would be beneficial to use the

[I] RecordBatchReader cast() errors introduced in pyarrow 16 [arrow]

2024-05-29 Thread via GitHub
davlee1972 opened a new issue, #41884: URL: https://github.com/apache/arrow/issues/41884 ### Describe the bug, including details regarding any error messages, version, and platform. RecordBatchReader.cast() was added in pyarrow 16.. Error: pyarrow.lib.ArrowTypeError: Field 0

[I] [DISCUSS] Seattle Arrow Meetup [arrow]

2024-05-29 Thread via GitHub
westonpace opened a new issue, #41881: URL: https://github.com/apache/arrow/issues/41881 ### Describe the enhancement requested I would like to offer to organize a face to face meetup for the Arrow community. ### Proposed dates: Date: Sunday August 11, 2024 or Thursday

Re: [I] [C++] ASOF join and sortedmerge join don't work in single threaded mode [arrow]

2024-05-29 Thread via GitHub
westonpace closed issue #41190: [C++] ASOF join and sortedmerge join don't work in single threaded mode URL: https://github.com/apache/arrow/issues/41190 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] [R] Improve error handling in the dplyr NSE code [arrow]

2024-05-29 Thread via GitHub
nealrichardson closed issue #41834: [R] Improve error handling in the dplyr NSE code URL: https://github.com/apache/arrow/issues/41834 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] Asof-joins inefficiently copy the left hand side [arrow]

2024-05-29 Thread via GitHub
JerAguilon opened a new issue, #41873: URL: https://github.com/apache/arrow/issues/41873 ### Describe the enhancement requested The asof-join has a big inefficiency when emitting columns from the left hand side of the join. Let me explain it visually, using a simple example from

[I] Pyarrow import fails when Arrow library is compiled with -fvisibility=hidden [arrow]

2024-05-29 Thread via GitHub
WillAyd opened a new issue, #41871: URL: https://github.com/apache/arrow/issues/41871 ### Describe the bug, including details regarding any error messages, version, and platform. I noticed when compiling the main Arrow library with -fvisibility=hidden you end up with two linker

Re: [I] [Packaging][MATLAB] Add crossbow job to package MATLAB interface on `macos-14` [arrow]

2024-05-29 Thread via GitHub
sgilmore10 closed issue #41675: [Packaging][MATLAB] Add crossbow job to package MATLAB interface on `macos-14` URL: https://github.com/apache/arrow/issues/41675 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[I] Connect timeout for Flight RPC client in Java imolementation [arrow]

2024-05-29 Thread via GitHub
contrapost opened a new issue, #41869: URL: https://github.com/apache/arrow/issues/41869 ### Describe the usage question you have. Please include as many useful details as possible. I didn't manage to find a way to configure connect timeout when setting up Flight client.

Re: [I] [Packaging][deb] libarrow1600 shoud be libarrow1601 for 16.1.0 [arrow]

2024-05-28 Thread via GitHub
kou closed issue #41679: [Packaging][deb] libarrow1600 shoud be libarrow1601 for 16.1.0 URL: https://github.com/apache/arrow/issues/41679 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] [Swift] Implement FFI support [arrow]

2024-05-28 Thread via GitHub
kou closed issue #37938: [Swift] Implement FFI support URL: https://github.com/apache/arrow/issues/37938 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[I] [JAVA] Parallel to build JNI C++ code [arrow]

2024-05-28 Thread via GitHub
jinchengchenghh opened a new issue, #41867: URL: https://github.com/apache/arrow/issues/41867 ### Describe the enhancement requested Now we use ``` cmake --build cpp-jni --target install --config Release ``` The CI use `export CMAKE_BUILD_PARALLEL_LEVEL=NUM_THREAD`

Re: [I] [R][CI]: Remove more defunct rhub containers [arrow]

2024-05-28 Thread via GitHub
jonkeane closed issue #41841: [R][CI]: Remove more defunct rhub containers URL: https://github.com/apache/arrow/issues/41841 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] [C++] Consuming or closing a RecordBatchReader created from a Dataset Scanner does not close underlying files [arrow]

2024-05-28 Thread via GitHub
bkietz closed issue #41771: [C++] Consuming or closing a RecordBatchReader created from a Dataset Scanner does not close underlying files URL: https://github.com/apache/arrow/issues/41771 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [I] [MATLAB] Add C Data Interface format import/export functionality for `arrow.tabular.RecordBatch` [arrow]

2024-05-28 Thread via GitHub
sgilmore10 closed issue #41803: [MATLAB] Add C Data Interface format import/export functionality for `arrow.tabular.RecordBatch` URL: https://github.com/apache/arrow/issues/41803 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

Re: [I] [Java] Implement a function to load field buffers from external buffers for StringView [arrow]

2024-05-28 Thread via GitHub
vibhatha closed issue #40931: [Java] Implement a function to load field buffers from external buffers for StringView URL: https://github.com/apache/arrow/issues/40931 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [I] [Java] Implement a strategy to return variable width buffer count for StringView in TypeLayout [arrow]

2024-05-28 Thread via GitHub
vibhatha closed issue #40935: [Java] Implement a strategy to return variable width buffer count for StringView in TypeLayout URL: https://github.com/apache/arrow/issues/40935 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [I] [Java] TypeLayout enhancement to support StringView [arrow]

2024-05-28 Thread via GitHub
vibhatha closed issue #40934: [Java] TypeLayout enhancement to support StringView URL: https://github.com/apache/arrow/issues/40934 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] Support LZ4_RAW for parquet writing [arrow]

2024-05-28 Thread via GitHub
douglas-raillard-arm opened a new issue, #41863: URL: https://github.com/apache/arrow/issues/41863 ### Describe the enhancement requested `pyarrow.dataset.write_dataset(compression='lz4_raw')` currently fails with: ``` Traceback (most recent call last): File

[I] Thread deadlock in ObjectOutputStream [arrow]

2024-05-28 Thread via GitHub
icexelloss opened a new issue, #41862: URL: https://github.com/apache/arrow/issues/41862 ### Describe the bug, including details regarding any error messages, version, and platform. I am seeing a deadlock when destructing an ObjectOutputStream. I have attached the stack trace.

[I] How to concatenate multiple tables in one parquet? [arrow]

2024-05-27 Thread via GitHub
zliucd opened a new issue, #41858: URL: https://github.com/apache/arrow/issues/41858 ### Describe the usage question you have. Please include as many useful details as possible. Hi, It's possible to write multiple tables in a single parquet by appending each rows from

Re: [I] [Packaging][RPM] Mismatch between package version and library version in naming [arrow]

2024-05-27 Thread via GitHub
kou closed issue #41784: [Packaging][RPM] Mismatch between package version and library version in naming URL: https://github.com/apache/arrow/issues/41784 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[I] Error repeating df.to_parquet in pytest: "pyarrow.lib.ArrowKeyError: A type extension with name pandas.period already defined" [arrow]

2024-05-27 Thread via GitHub
bjfar opened a new issue, #41857: URL: https://github.com/apache/arrow/issues/41857 ### Describe the bug, including details regarding any error messages, version, and platform. Python version: 3.10.14 pyarrow version: 16.1.0 pandas version: 2.2.2 pytest version: 8.2.1

Re: [I] arrow flight sql jdbc drive with Lz4Compression [arrow]

2024-05-27 Thread via GitHub
kou closed issue #41456: arrow flight sql jdbc drive with Lz4Compression URL: https://github.com/apache/arrow/issues/41456 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[I] [CI][Packaging] Fix conda arrow-nightlies channel [arrow]

2024-05-27 Thread via GitHub
amoeba opened a new issue, #41856: URL: https://github.com/apache/arrow/issues/41856 ### Describe the bug, including details regarding any error messages, version, and platform. The Conda [arrow-nightlies channel is

[I] [R][CI]: Remove more defunct rhub containers [arrow]

2024-05-27 Thread via GitHub
jonkeane opened a new issue, #41841: URL: https://github.com/apache/arrow/issues/41841 ### Describe the enhancement requested In debugging a CRAN submission, found another location where we are using the stale rhub containers. ### Component(s) Continuous Integration, R

[I] [Format][FlightRPC] Flight SQL evolution [arrow]

2024-05-27 Thread via GitHub
lidavidm opened a new issue, #41840: URL: https://github.com/apache/arrow/issues/41840 ### Describe the enhancement requested From https://github.com/apache/arrow-rs/issues/5731#issuecomment-2133104504 Originally Flight RPC was implemented as a framework wrapping gRPC. This was

Re: [I] [C++] take into account orc's capabilities for finding tzdb [arrow]

2024-05-27 Thread via GitHub
kou closed issue #41755: [C++] take into account orc's capabilities for finding tzdb URL: https://github.com/apache/arrow/issues/41755 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] Add support for FileIO [arrow-julia]

2024-05-27 Thread via GitHub
Beforerr opened a new issue, #507: URL: https://github.com/apache/arrow-julia/issues/507 It is registered in FileIO however neither `load` nor `fileio_load` is defined. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[I] Issue using open_dataset() in r4.4.0 [arrow]

2024-05-26 Thread via GitHub
SHEvElynP opened a new issue, #41835: URL: https://github.com/apache/arrow/issues/41835 ### Describe the usage question you have. Please include as many useful details as possible. Hello My workplace has recently moved from R4.3.2 to R4.4.0. I used to be able to do

[I] Fields within a null struct are not initialized with null values [arrow]

2024-05-26 Thread via GitHub
timsaucer opened a new issue, #41833: URL: https://github.com/apache/arrow/issues/41833 ### Describe the bug, including details regarding any error messages, version, and platform. When creating an array from a python dict, field entries of a null struct are initialized with default

Re: [I] [GLib] Allow getting a RecordBatchReader from a Dataset or Dataset Scanner [arrow]

2024-05-25 Thread via GitHub
kou closed issue #41749: [GLib] Allow getting a RecordBatchReader from a Dataset or Dataset Scanner URL: https://github.com/apache/arrow/issues/41749 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[I] [R] Update relative URLs in README to absolute paths to prevent CRAN check failures [arrow]

2024-05-25 Thread via GitHub
thisisnic opened a new issue, #41829: URL: https://github.com/apache/arrow/issues/41829 ### Describe the bug, including details regarding any error messages, version, and platform. In #40148, we updated the README, but there were some URLs in there which pointed to relative links;

Re: [I] [R] Update NEWS.md for 16.0.0 [arrow]

2024-05-25 Thread via GitHub
thisisnic closed issue #41420: [R] Update NEWS.md for 16.0.0 URL: https://github.com/apache/arrow/issues/41420 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[I] [C++][Parquet][Benchmark] Adding benchmarking for reading Statistics [arrow]

2024-05-25 Thread via GitHub
mapleFU opened a new issue, #41826: URL: https://github.com/apache/arrow/issues/41826 ### Describe the enhancement requested This pr ( https://github.com/apache/arrow/pull/41761 ) does a basics for benchmarking metadata. We'd like to add more benchmarks on Statistics

Re: [I] [CI][GLib] Suppress "`unlink': Permission denied" warnings in tests on Windows [arrow]

2024-05-24 Thread via GitHub
kou closed issue #41770: [CI][GLib] Suppress "`unlink': Permission denied" warnings in tests on Windows URL: https://github.com/apache/arrow/issues/41770 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] python/adbc_driver_postgresql ingest NOT_IMPLEMENTED when running adbc_ingest with json array [arrow-adbc]

2024-05-24 Thread via GitHub
lidavidm closed issue #1868: python/adbc_driver_postgresql ingest NOT_IMPLEMENTED when running adbc_ingest with json array URL: https://github.com/apache/arrow-adbc/issues/1868 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [I] [Java] Enhance the `copyFrom*` functionality in StringView [arrow]

2024-05-24 Thread via GitHub
lidavidm closed issue #40933: [Java] Enhance the `copyFrom*` functionality in StringView URL: https://github.com/apache/arrow/issues/40933 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[I] [C++][Parquet] Unify normalize dictionary encoding handling [arrow]

2024-05-24 Thread via GitHub
mapleFU opened a new issue, #41818: URL: https://github.com/apache/arrow/issues/41818 ### Describe the enhancement requested This is mentioned here: https://github.com/apache/arrow/pull/40957#discussion_r1562703901 There're some points: 1.

[I] Create Meson WrapDB Entry for Arrow [arrow]

2024-05-24 Thread via GitHub
WillAyd opened a new issue, #41816: URL: https://github.com/apache/arrow/issues/41816 ### Describe the enhancement requested Meson has a rather nice collection of projects in its WrapDB, which makes it rather easy to add dependencies to your project:

[I] `pyarrow.write_feather` can't be used in `atexit` contexts to write a `pandas.DataFrame` [arrow]

2024-05-24 Thread via GitHub
pjh40 opened a new issue, #41815: URL: https://github.com/apache/arrow/issues/41815 ### Describe the bug, including details regarding any error messages, version, and platform. When `pyarrow.write_feather()` is given a `pandas.DataFrame`, `write_feather()` unconditionally calls

Re: [I] [C++] Clean up Assorted Warnings to get a clean nanoarrow build [arrow]

2024-05-24 Thread via GitHub
bkietz closed issue #41478: [C++] Clean up Assorted Warnings to get a clean nanoarrow build URL: https://github.com/apache/arrow/issues/41478 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[I] Segfault when collecting parquet dataset query results [arrow]

2024-05-24 Thread via GitHub
mrd0ll4r opened a new issue, #41813: URL: https://github.com/apache/arrow/issues/41813 ### Describe the bug, including details regarding any error messages, version, and platform. Hello! I've been using arrow with R for a while now to great success. Recently, I've re-opened an

[I] Table.from_arrow can't import nan values into a non-null float column [arrow]

2024-05-24 Thread via GitHub
lord opened a new issue, #41812: URL: https://github.com/apache/arrow/issues/41812 ### Describe the bug, including details regarding any error messages, version, and platform. This small examples fails with `ValueError: Field pyarrow.Field was non-nullable but pandas column had 1

Re: [I] [C++] Importing an extension type without `ARROW:extension:metadata` crashes [arrow]

2024-05-24 Thread via GitHub
paleolimbot closed issue #41741: [C++] Importing an extension type without `ARROW:extension:metadata` crashes URL: https://github.com/apache/arrow/issues/41741 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[I] [C++] Add Compute Kernel for Casting from union to string [arrow]

2024-05-24 Thread via GitHub
llama90 opened a new issue, #41810: URL: https://github.com/apache/arrow/issues/41810 ### Describe the enhancement requested This is a sub-issue of the issue mentioned below. - #35560 This issue is aiming to address #39182. A pull request

[I] [C++] Add Compute Kernel for Casting from map to string [arrow]

2024-05-24 Thread via GitHub
llama90 opened a new issue, #41809: URL: https://github.com/apache/arrow/issues/41809 ### Describe the enhancement requested This is a sub-issue of the issue mentioned below. - #35560 This issue is aiming to address #39182. A pull request

[I] [JAVA] Jni mvn generate-resources failed because not generate arrow-bom [arrow]

2024-05-24 Thread via GitHub
jinchengchenghh opened a new issue, #41808: URL: https://github.com/apache/arrow/issues/41808 ### Describe the bug, including details regarding any error messages, version, and platform. arrow_ep/src/arrow_ep/java# mvn generate-resources -P generate-libs-cdata-all-os

Re: [I] [Java] Adding `variadicBufferCounts` to `RecordBatch` [arrow]

2024-05-23 Thread via GitHub
lidavidm closed issue #41730: [Java] Adding `variadicBufferCounts` to `RecordBatch` URL: https://github.com/apache/arrow/issues/41730 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] [Java] Nullability of struct child vectors not preserved in TransferPair [arrow]

2024-05-23 Thread via GitHub
lidavidm closed issue #41686: [Java] Nullability of struct child vectors not preserved in TransferPair URL: https://github.com/apache/arrow/issues/41686 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [I] [Java] Transition from gradle-enterprise-maven-extension to develocity-maven-extension [arrow]

2024-05-23 Thread via GitHub
lidavidm closed issue #41799: [Java] Transition from gradle-enterprise-maven-extension to develocity-maven-extension URL: https://github.com/apache/arrow/issues/41799 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [I] [Java] Implement a function to retrieve reference buffers in StringView [arrow]

2024-05-23 Thread via GitHub
lidavidm closed issue #40930: [Java] Implement a function to retrieve reference buffers in StringView URL: https://github.com/apache/arrow/issues/40930 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[I] [GLib][CI] Use vcpkg for C++ dependencies when building GLib libraries with MSVC [arrow]

2024-05-23 Thread via GitHub
adamreeve opened a new issue, #41806: URL: https://github.com/apache/arrow/issues/41806 ### Describe the enhancement requested This is a follow up to #41134 and should hopefully allow building more of the GLib libraries with MSVC. Context:

Re: [I] [Java] Use immutables value-annotations instead of value artifact [arrow]

2024-05-23 Thread via GitHub
lidavidm closed issue #41789: [Java] Use immutables value-annotations instead of value artifact URL: https://github.com/apache/arrow/issues/41789 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] [GLib] Add support for MSVC with vcpkg [arrow]

2024-05-23 Thread via GitHub
kou closed issue #41134: [GLib] Add support for MSVC with vcpkg URL: https://github.com/apache/arrow/issues/41134 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] [C++] Thirdparty: Bump xsimd to 13.0.0 [arrow]

2024-05-23 Thread via GitHub
kou closed issue #41547: [C++] Thirdparty: Bump xsimd to 13.0.0 URL: https://github.com/apache/arrow/issues/41547 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] [C++][Flight] Flight benchmark doesn't work anymore [arrow]

2024-05-23 Thread via GitHub
kou closed issue #41780: [C++][Flight] Flight benchmark doesn't work anymore URL: https://github.com/apache/arrow/issues/41780 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[I] [Swift] Add Struct (Nested) types [arrow]

2024-05-23 Thread via GitHub
abandy opened a new issue, #41804: URL: https://github.com/apache/arrow/issues/41804 ### Describe the enhancement requested Struct (Nested) types are currently not implemented in Swift. Adding Nested types as this is required to implement other arrow features. ###

[I] [MATLAB] Add C Data Interface format import/export functionality for `arrow.tabular.RecordBatch` [arrow]

2024-05-23 Thread via GitHub
sgilmore10 opened a new issue, #41803: URL: https://github.com/apache/arrow/issues/41803 ### Describe the enhancement requested Now that #41656 has been closed, we should add MATLAB APIs for importing/exporting `arrow.tabular.RecordBatch`es using the C Data Interface format.

Re: [I] [Java] Java Cookbook fails on 16.0.0-SNAPSHOT [arrow-cookbook]

2024-05-23 Thread via GitHub
amoeba closed issue #347: [Java] Java Cookbook fails on 16.0.0-SNAPSHOT URL: https://github.com/apache/arrow-cookbook/issues/347 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [C++][Parquet][Doc] Denote PARQUET:field_id in parquet.rst [arrow]

2024-05-23 Thread via GitHub
pitrou closed issue #41186: [C++][Parquet][Doc] Denote PARQUET:field_id in parquet.rst URL: https://github.com/apache/arrow/issues/41186 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] [C++][Acero] An useless parameter for QueryContext::Init called in hash_join_benchmark [arrow]

2024-05-23 Thread via GitHub
pitrou closed issue #41720: [C++][Acero] An useless parameter for QueryContext::Init called in hash_join_benchmark URL: https://github.com/apache/arrow/issues/41720 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

Re: [I] [C++] Add functionality to MemoryManager for copying a slice of a buffer [arrow]

2024-05-23 Thread via GitHub
pitrou closed issue #39858: [C++] Add functionality to MemoryManager for copying a slice of a buffer URL: https://github.com/apache/arrow/issues/39858 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[I] [C++][S3] Remove GetBucketRegion hack for never AWS SDK versions [arrow]

2024-05-23 Thread via GitHub
pitrou opened a new issue, #41797: URL: https://github.com/apache/arrow/issues/41797 ### Describe the enhancement requested In https://github.com/aws/aws-sdk-cpp/issues/1885#issuecomment-2118124214 it was pointed out that the "x-amz-bucket-region" header of successful HeadBucket

[I] Deserialization as Vector{SubArray} breaks `push!` on DataFrame [arrow-julia]

2024-05-23 Thread via GitHub
maleadt opened a new issue, #506: URL: https://github.com/apache/arrow-julia/issues/506 I'm using Arrow v2.7.2 with DataFrames v1.6.1 on Julia 1.10, and am running into an issue that seems to stem from Arrow.jl deserializing my `Vector{Vector{T}}` columns as `Vector{SubArray{...}}`:

[I] [C++] [Python] Add functionality of `STSProfileCredentialsProvider` to default credentials chain for `S3FileSystem` [arrow]

2024-05-23 Thread via GitHub
fjetter opened a new issue, #41794: URL: https://github.com/apache/arrow/issues/41794 ### Describe the enhancement requested Given a typical AWS credentials setup that defines IAM roles like the following ``` # ~/.aws/config [default] region=us-east-2

Re: [I] Can get tables info with schema contain custom field type [arrow]

2024-05-23 Thread via GitHub
Curricane closed issue #41722: Can get tables info with schema contain custom field type URL: https://github.com/apache/arrow/issues/41722 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] [Java] fmpp-maven-plugin generates files directly under target/generated-sources [arrow]

2024-05-22 Thread via GitHub
lidavidm closed issue #41787: [Java] fmpp-maven-plugin generates files directly under target/generated-sources URL: https://github.com/apache/arrow/issues/41787 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[I] [CI][Integration][Release] RC verification script failed [arrow]

2024-05-22 Thread via GitHub
kou opened a new issue, #41792: URL: https://github.com/apache/arrow/issues/41792 ### Describe the bug, including details regarding any error messages, version, and platform. verify-rc-source-integration-linux-almalinux-8-amd64:

[I] [CI][Conda] The CondaEnvironment@1 (Conda environment) task has been deprecated since February 13, 2019 and will soon be retired [arrow]

2024-05-22 Thread via GitHub
kou opened a new issue, #41791: URL: https://github.com/apache/arrow/issues/41791 ### Describe the bug, including details regarding any error messages, version, and platform. conda-linux-aarch64-cpu-py3:

Re: [I] [C++] Make git-dependent preprocessor definitions internal [arrow]

2024-05-22 Thread via GitHub
kou closed issue #41783: [C++] Make git-dependent preprocessor definitions internal URL: https://github.com/apache/arrow/issues/41783 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] [Java] fmpp-maven-plugin generates files directly under target/generated-sources [arrow]

2024-05-22 Thread via GitHub
laurentgo opened a new issue, #41787: URL: https://github.com/apache/arrow/issues/41787 ### Describe the bug, including details regarding any error messages, version, and platform. `fmpp-plugin-maven` is used in `arrow-vector` module to generate source files before the compilation

Re: [I] [CI][Python] AMD64 Conda Java C Data Interface Integration Failure building PyArrow trying to use PYARROW_PARQUET [arrow]

2024-05-22 Thread via GitHub
kou closed issue #41725: [CI][Python] AMD64 Conda Java C Data Interface Integration Failure building PyArrow trying to use PYARROW_PARQUET URL: https://github.com/apache/arrow/issues/41725 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[I] Add option to disable exact matches optional in join_asof [arrow]

2024-05-22 Thread via GitHub
0x26res opened a new issue, #41786: URL: https://github.com/apache/arrow/issues/41786 ### Describe the enhancement requested I would like to do a `join_asof` that would exclude exact matches. This is supported in pandas

[I] Mismatch between package version and library version in naming [arrow]

2024-05-22 Thread via GitHub
daeden opened a new issue, #41784: URL: https://github.com/apache/arrow/issues/41784 ### Describe the bug, including details regarding any error messages, version, and platform. **Version**: 16.1.0 **Platform**: Details about the operating system or environment where the bug

Re: [I] [C++][Parquet] Thrift: generate template method to accelerate reading thrift [arrow]

2024-05-22 Thread via GitHub
pitrou closed issue #41702: [C++][Parquet] Thrift: generate template method to accelerate reading thrift URL: https://github.com/apache/arrow/issues/41702 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[I] ADBC Python Postgres - Stuck connections to the database [arrow-adbc]

2024-05-22 Thread via GitHub
gaspardc-met opened a new issue, #1881: URL: https://github.com/apache/arrow-adbc/issues/1881 ### What happened? Context before the bug (working): - Postgres database on Kubernetes with several tables - 4 services (webapp, machine learning inference, and FastAPI backend APIs)

Re: [I] [C++][Parquet] Minor: moving EncodedStats by default rather than copying [arrow]

2024-05-22 Thread via GitHub
mapleFU closed issue #41726: [C++][Parquet] Minor: moving EncodedStats by default rather than copying URL: https://github.com/apache/arrow/issues/41726 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[I] [C++] Make git-dependent preprocessor definitions internal [arrow]

2024-05-22 Thread via GitHub
pitrou opened a new issue, #41783: URL: https://github.com/apache/arrow/issues/41783 ### Describe the enhancement requested The `ARROW_GIT_ID` and `ARROW_GIT_DESCRIPTION` preprocessor variables are currently exposed in `arrow/util/config.h`, and included from `arrow/config.h`. This

Re: [I] ParquetDataset object fails with a .read() method due to hive partition schema columns. [arrow]

2024-05-22 Thread via GitHub
j0bekt01 closed issue #41779: ParquetDataset object fails with a .read() method due to hive partition schema columns. URL: https://github.com/apache/arrow/issues/41779 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[I] [Java][Flight] Flight SQL tests are flaky [arrow]

2024-05-22 Thread via GitHub
laurentgo opened a new issue, #41782: URL: https://github.com/apache/arrow/issues/41782 ### Describe the bug, including details regarding any error messages, version, and platform. Several test failures in `flight-sql` module have been observed in multiple job executions: -

[I] [C++][Flight] Flight benchmark doesn't work anymore [arrow]

2024-05-22 Thread via GitHub
pitrou opened a new issue, #41780: URL: https://github.com/apache/arrow/issues/41780 ### Describe the bug, including details regarding any error messages, version, and platform. On my local build: ```console $ /build/build-release/relwithdebinfo/arrow-flight-benchmark

[I] ParquetDataset object fails with a .read() method due to hive partition schema columns. [arrow]

2024-05-22 Thread via GitHub
j0bekt01 opened a new issue, #41779: URL: https://github.com/apache/arrow/issues/41779 ### Describe the bug, including details regarding any error messages, version, and platform. I'm trying to read parquet files from S3 that have a Hive partition

Re: [I] [C++][Parquet] Add file metadata read/write benchmark [arrow]

2024-05-22 Thread via GitHub
pitrou closed issue #41760: [C++][Parquet] Add file metadata read/write benchmark URL: https://github.com/apache/arrow/issues/41760 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] pyarrow.fs.HadoopFileSystem Usage Problems [arrow]

2024-05-22 Thread via GitHub
deep826 opened a new issue, #41777: URL: https://github.com/apache/arrow/issues/41777 ### Describe the usage question you have. Please include as many useful details as possible. hi, I use pyarrow.fs.HadoopFileSystem client to interact with hdfs. And I write some bytes to a

[I] Wrong Length value in the example of ListView in Columnar specification document [arrow]

2024-05-22 Thread via GitHub
Jagdish-Motwani opened a new issue, #41774: URL: https://github.com/apache/arrow/issues/41774 ### Describe the bug, including details regarding any error messages, version, and platform. In the example Layout: ``ListView`` Array with 5 elements, the length is specified as 4.

Re: [I] [Python][Parquet] Documentation to parquet.write_table should be updated for new byte_stream_split encoding options [arrow]

2024-05-22 Thread via GitHub
jorisvandenbossche closed issue #41748: [Python][Parquet] Documentation to parquet.write_table should be updated for new byte_stream_split encoding options URL: https://github.com/apache/arrow/issues/41748 -- This is an automated message from the Apache Git Service. To respond to the

[I] [C++] Consuming or closing a RecordBatchReader created from a Dataset Scanner does not close underlying files [arrow]

2024-05-21 Thread via GitHub
adamreeve opened a new issue, #41771: URL: https://github.com/apache/arrow/issues/41771 ### Describe the bug, including details regarding any error messages, version, and platform. Code to reproduce as a unit test that I added to `cpp/src/arrow/dataset/dataset_test.cc`, which logs

[I] [CI][GLib] Suppress "`unlink': Permission denied" warnings in tests on Windows [arrow]

2024-05-21 Thread via GitHub
kou opened a new issue, #41770: URL: https://github.com/apache/arrow/issues/41770 ### Describe the enhancement requested https://github.com/apache/arrow/actions/runs/9183539981/job/25254413025#step:12:83 ```text test/run-test.rb: warning: Exception in finalizer #>

[I] [Java] Rework how Java cookbooks are developed and built [arrow-cookbook]

2024-05-21 Thread via GitHub
amoeba opened a new issue, #351: URL: https://github.com/apache/arrow-cookbook/issues/351 In https://github.com/apache/arrow-cookbook/pull/350#issuecomment-2121850653 it was pointed out that the way the Java cookbooks work could be improved quite a bit. We might consider two more recent

Re: [I] [GLib] Separate version macros for each GLib library [arrow]

2024-05-21 Thread via GitHub
kou closed issue #41681: [GLib] Separate version macros for each GLib library URL: https://github.com/apache/arrow/issues/41681 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[I] [C++][Parquet] `parquet::arrow::FileWriter` does not propagate schema-level metadata when `ArrowWriterProperties::store_schema` is false [arrow]

2024-05-21 Thread via GitHub
TheNeuralBit opened a new issue, #41766: URL: https://github.com/apache/arrow/issues/41766 ### Describe the bug, including details regarding any error messages, version, and platform. When `store_schema` is true the `FileWriter` first copies any existing metadata before storing the

[I] [Parquet][C++] Behaviour of unknown logical type when encountered in Parquet reader [arrow]

2024-05-21 Thread via GitHub
paleolimbot opened a new issue, #41764: URL: https://github.com/apache/arrow/issues/41764 ### Describe the enhancement requested In https://github.com/apache/parquet-format/pull/240 there is concern regarding the ability to add a new logical type (in this case GEOMETRY) in a

Re: [I] [C++] Import/Export ArrowDeviceArrayStream [arrow]

2024-05-21 Thread via GitHub
zeroshade closed issue #40078: [C++] Import/Export ArrowDeviceArrayStream URL: https://github.com/apache/arrow/issues/40078 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] [R][CI] CRAN-style openssl not being picked up [arrow]

2024-05-21 Thread via GitHub
assignUser closed issue #41426: [R][CI] CRAN-style openssl not being picked up URL: https://github.com/apache/arrow/issues/41426 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [C++] Add a benchmark for grouper for preventing performance regression [arrow]

2024-05-21 Thread via GitHub
pitrou closed issue #41035: [C++] Add a benchmark for grouper for preventing performance regression URL: https://github.com/apache/arrow/issues/41035 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] adbc_ingest() is dropping rows in Snowflake [arrow-adbc]

2024-05-21 Thread via GitHub
zeroshade closed issue #1847: adbc_ingest() is dropping rows in Snowflake URL: https://github.com/apache/arrow-adbc/issues/1847 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[I] [C++][Parquet] Add file metadata read/write benchmark [arrow]

2024-05-21 Thread via GitHub
pitrou opened a new issue, #41760: URL: https://github.com/apache/arrow/issues/41760 ### Describe the enhancement requested Following the discussions on the Parquet ML (see [this thread](https://lists.apache.org/thread/5jyhzkwyrjk9z52g0b49g31ygnz73gxo) and [this

[I] [C++][Python] Segfault when reading a RecordBatchReader constructed from an Arrow Table [arrow]

2024-05-21 Thread via GitHub
Mytherin opened a new issue, #41758: URL: https://github.com/apache/arrow/issues/41758 ### Describe the bug, including details regarding any error messages, version, and platform. The following code snippet crashes for me when running PyArrow 16.1 in Python 3.12: ```py

Re: [I] [Python] Expose bit_width and byte_width on Python Extension types with underlying fixed type [arrow]

2024-05-21 Thread via GitHub
jorisvandenbossche closed issue #41389: [Python] Expose bit_width and byte_width on Python Extension types with underlying fixed type URL: https://github.com/apache/arrow/issues/41389 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

  1   2   3   4   5   6   7   8   9   10   >