issues
Thread
Date
Earlier messages
Messages by Thread
Re: [I] pyarrow's GcsFileSystem fails with "SSL peer certificate or SSH remote key was not OK" [arrow]
via GitHub
[I] [Python][Packaging] Manylinux image does not support Python 3.13 free-threaded anymore as it was experimental [arrow]
via GitHub
[I] [Parquet] Cannot write LARGE_LIST [arrow-go]
via GitHub
Re: [I] [Discussion][C++][Statistics] Should ApproximateEquals or Equals be used to handle floating-point precision errors in arrow::ArrayStatistics? [arrow]
via GitHub
[I] [C++] Access mimalloc through dynamically-resolved symbols [arrow]
via GitHub
Re: [I] [Python] missing non-free-threaded manylinux nightly wheels [arrow]
via GitHub
Re: [I] [C++] Use C++17's nested namespaces [arrow]
via GitHub
[I] [C++][Compute] Remove redundant cast kernels [arrow]
via GitHub
[I] [C++][IPC] Possible int64 overflow in ReadSparseCSXIndex buffer size validation [arrow]
via GitHub
[I] [C++][ORC] Signed integer overflow when converting far-future timestamps [arrow]
via GitHub
[I] [C++][Gandiva] Buffer overrun in to_hex(int32) and to_hex(int64) [arrow]
via GitHub
[I] AzureFileSystem can't use GetProperties with User Delegation SAS tokens [arrow]
via GitHub
Re: [I] [C++] Reduce number of internal APIs that define default_memory_pool() as default argument value [arrow]
via GitHub
Re: [I] [C++] Add option to not create parent directory with S3 `delete_file` [arrow]
via GitHub
[I] [Python] Add tests for regular replace_with_mask kernel usage [arrow]
via GitHub
Re: [I] [C++][Acero] If `initial_task` in Acero plan throws an exception the plan hangs indefinitely [arrow]
via GitHub
Re: [I] docs: Is ADBC support in DuckDB still in progress? [arrow-adbc]
via GitHub
[I] AWS event loop thread pool size is out of control again [arrow]
via GitHub
Re: [I] [Python] AWS event loop thread pool size is out of control again [arrow]
via GitHub
[I] [C++] Out-of-bounds buffer slice reading a Feather V1 file with inconsistent column metadata [arrow]
via GitHub
Re: [I] [Python] Support joinining tables with null columns [arrow]
via GitHub
[I] [Packaging] Debian forky dropped libboost-system-dev causing apt install error [arrow]
via GitHub
Re: [I] [Packaging] Debian forky dropped libboost-system-dev causing apt install error [arrow]
via GitHub
[I] [C++] Tensor::Make skips buffer-size check for row-major (implicit strides) tensors [arrow]
via GitHub
Re: [I] [C++] Tensor::Make skips buffer-size check for row-major (implicit strides) tensors [arrow]
via GitHub
[I] All data is null for one column in one row group for parquet, arrow will encode with dictionary, while parquet-java use encode plain [arrow]
via GitHub
[I] [R] Implement bindings for stringr's combining strings functions [arrow]
via GitHub
Re: [I] [C++] Add XSimd Support to Meson Configuration [arrow]
via GitHub
Re: [I] [C++][Python] `replace_with_mask` crashes when null type inputs are used [arrow]
via GitHub
[I] [C++] Signed integer overflow in Decimal FromString exponent handling [arrow]
via GitHub
[I] [C++][IPC] ReadSparseCOOIndex does not validate the size of the indices buffer [arrow]
via GitHub
[I] [CI][C++] Bump vcpkg version [arrow]
via GitHub
Re: [I] [CI][C++] Bump vcpkg version [arrow]
via GitHub
[I] Audit WKB parsing size computations for 32-bit overflow-before-bounds-check patterns [arrow]
via GitHub
Re: [I] [Python] Make pyarrow installable in Jupyterlite [arrow]
via GitHub
[I] [CI][C++] Improve GitHub Actions ccache reuse by saving compiler caches only from main [arrow]
via GitHub
Re: [I] [C++] Add Compute Module Support to Meson Configuration [arrow]
via GitHub
[I] Table.group_by(...).aggregate([("flag", "any")])` returns incorrect `True` on a sliced Boolean array with nulls [arrow]
via GitHub
[I] [Python][CI] test_string_to_tzinfo_pytz_fallback fails on verify-rc-source-windows [arrow]
via GitHub
Re: [I] [Python][CI] test_string_to_tzinfo_pytz_fallback fails on verify-rc-source-windows [arrow]
via GitHub
[I] arrow/scalar: *scalar.Extension does not implement Release()/Retain(), leaking storage through compute.ScalarDatum.Release() [arrow-go]
via GitHub
[I] [Python][CI] test_table_uses_memory_pool flaky on macOS 14 job [arrow]
via GitHub
Re: [I] [Python][Parquet] read_schema drops extension types (UUID returned as fixed_size_binary[16]) [arrow]
via GitHub
Re: [I] [C++] Allow scanner to assert an ordering and/or support implicit ordering [arrow]
via GitHub
Re: [I] [Parquet][Python] API to decrypt parquet file using one DEK and no metadata [arrow]
via GitHub
[I] [Python] Expose Expression.field_refs() to enumerate referenced fields [arrow]
via GitHub
[I] Add `arrow.range` canonical extension type for bounded ranges [arrow]
via GitHub
Re: [I] [C++][Acero] record_batch_reader_source does not support `select * limit 3` [arrow]
via GitHub
Re: [I] r/adbcsnowflake: Snowflake driver logs at info level causing CMD check fail [arrow-adbc]
via GitHub
[I] parquet/file: WriteBatchSpaced panics escape the API and silently discards commit-write errors [arrow-go]
via GitHub
[I] [C++][Parquet] SIMD-accelerate the SBBF probe in BlockSplitBloomFilter::FindHash [arrow]
via GitHub
Re: [I] [Docs][CI] Enable version switcher during local and PR preview builds [arrow]
via GitHub
Re: [I] [C++] Pivot Support in Acero [arrow]
via GitHub
[I] [Dev] Enable auto GitHub Copilot review [arrow]
via GitHub
Re: [I] [Dev] Enable auto GitHub Copilot review [arrow]
via GitHub
Re: [I] GeoArrow and GeoParquet [arrow-go]
via GitHub
Re: [I] go/adbc/driver/snowflake: NUMBER(n, 0) values get truncated [arrow-adbc]
via GitHub
[I] [CI][C++]: Resolve the macos-cran nightly failures [arrow]
via GitHub
[I] The "copylocks" warning is present in the file "numeric_generic.go". [arrow-go]
via GitHub
Re: [I] The "copylocks" warning is present in the file "numeric_generic.go". [arrow-go]
via GitHub
[I] [CI][Release] Windows verification jobs to set conda environment [arrow]
via GitHub
Re: [I] [CI][Release] Windows verification jobs to set conda environment [arrow]
via GitHub
Re: [I] go/adbc/driver/snowflake: missing xdbc_column_size for binary columns in GetObjects [arrow-adbc]
via GitHub
Re: [I] Support arrow list and large_list dtypes when ingesting to snowflake [arrow-adbc]
via GitHub
[I] [R] Import of S3 methods from bit64 [arrow]
via GitHub
Re: [I] [R] Import of S3 methods from bit64 [arrow]
via GitHub
Re: [I] [C++] Allow more Flatbuffers versions to compile Arrow [arrow]
via GitHub
Re: [I] [C++] Error linking the util/cancel.h [arrow]
via GitHub
[I] [Python] Table.from_pylist on ExtensionType column with list_ storage crashes when values exceed int32 offsets [arrow]
via GitHub
Re: [I] [Python] Table.from_pylist on ExtensionType column with list_ storage crashes when values exceed int32 offsets [arrow]
via GitHub
[I] [C++][Parquet] Undefined behavior in `TypedColumnWriterImpl::UpdateLevelHistogram` [arrow]
via GitHub
Re: [I] [C++][Parquet] Undefined behavior in `TypedColumnWriterImpl::UpdateLevelHistogram` [arrow]
via GitHub
[I] R - FinalizeS3 segfault [arrow]
via GitHub
[I] [C++][Parquet] Add bloom filter folding to automatically size SBBF filters [arrow]
via GitHub
Re: [I] [C++][FlightRPC] ODBC macOS `.pkg` Installer [arrow]
via GitHub
[I] [C++] Use FetchContent for RapidJSON [arrow]
via GitHub
Re: [I] [C++] Use FetchContent for RapidJSON [arrow]
via GitHub
Re: [I] [C++] Address "Compatibility with CMake < 3.5 has been removed" error [arrow]
via GitHub
Re: [I] Python Snowflake Driver has incorrect documentation of adbc.rpc.result_queue_size [arrow-adbc]
via GitHub
[I] [CI][Python] Revert pinning miniforge once mamba solver issue is resolved [arrow]
via GitHub
Re: [I] snowflake: `adbc_ingest` will fail with "double free" segmentation fault if record batch schema is incorrect [arrow-adbc]
via GitHub
[I] [C++] Some builds fail to build due to gRPC failures [arrow]
via GitHub
Re: [I] [C++] Some builds fail to build due to gRPC failures [arrow]
via GitHub
[I] [C++][FlightRPC] <grpcpp/version_info.h> not found [arrow]
via GitHub
Re: [I] [C++][FlightRPC] <grpcpp/version_info.h> not found [arrow]
via GitHub
[I] [R] Support for Tensor class [arrow]
via GitHub
Re: [I] [R] Support for Tensor class [arrow]
via GitHub
[I] [R] open_dataset with root directory inaccessible? [arrow]
via GitHub
Re: [I] [R] open_dataset with root directory inaccessible? [arrow]
via GitHub
[I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
[I] csharp: driver manager incorrectly loads and validates manifests [arrow-adbc]
via GitHub
Re: [I] csharp: driver manager incorrectly loads and validates manifests [arrow-adbc]
via GitHub
[I] [CI][Python] AMD64 Conda Python 3.10 Pandas 1.3.4 job consistently timing out [arrow]
via GitHub
Re: [I] [CI][Python] AMD64 Conda Python 3.10 Pandas 1.3.4 job consistently timing out [arrow]
via GitHub
[I] csharp: literal strings not supported by driver manager toml parser [arrow-adbc]
via GitHub
Re: [I] csharp: literal strings not supported by driver manager toml parser [arrow-adbc]
via GitHub
[I] csharp: driver manager missing mac search path [arrow-adbc]
via GitHub
Re: [I] csharp: driver manager missing mac search path [arrow-adbc]
via GitHub
[I] [R][Wasm] Fix Error: thread constructor failed: Not supported under Wasm [arrow]
via GitHub
[I] [CI] Drop obsolete
[email protected]
brew uninstall from cpp.yml and python.yml [arrow]
via GitHub
Re: [I] [CI] Drop obsolete
[email protected]
brew uninstall from cpp.yml and python.yml [arrow]
via GitHub
Re: [I] [C#] BitUtility.cs performance improvement [arrow]
via GitHub
[I] BitUtility.cs performance enhancement [arrow-dotnet]
via GitHub
[I] Fix Unity build ordering issue [arrow]
via GitHub
Re: [I] [C++][FlightRPC] Fix Unity build ordering issue [arrow]
via GitHub
[I] parquet/file: NewParquetWriter panics on transient sink.Write errors during file initialization [arrow-go]
via GitHub
Re: [I] parquet/file: NewParquetWriter panics on transient sink.Write errors during file initialization [arrow-go]
via GitHub
Re: [I] pyarrow tranport Tensor type data to java arrow flight server [arrow]
via GitHub
Re: [I] pyarrow tranport Tensor type data to java arrow flight server [arrow]
via GitHub
[I] [Release][Packaging] Add Reproducible Builds for RPM based packages [arrow]
via GitHub
[I] [Release][Packaging] Add Reproducible build for Debian Packages [arrow]
via GitHub
Re: [I] [Release][Packaging] Add Reproducible build for Debian Packages [arrow]
via GitHub
[I] [CI][Packaging] Use random build directory path for Debian Packages instead of fixed one [arrow]
via GitHub
[I] [Java] DictionaryEncoder doesn't crash when decoding index outside of Dictionary [arrow-java]
via GitHub
[I] Integration tests failing with Rust producing and .NET consuming binary views [arrow-dotnet]
via GitHub
Re: [I] Integration tests failing with Rust producing and .NET consuming binary views [arrow-dotnet]
via GitHub
[I] [C++][Gandiva] Duplicate function aliases with same parameters [arrow]
via GitHub
[I] [postgresql] have a way of telling the driver to avoid `ROLLBACK AND CHAIN` [arrow-adbc]
via GitHub
Re: [I] unsupported cast to string_view from utf8 in v18 [arrow-go]
via GitHub
[I] c/driver/postgresql: adbc_ingest silently misaligns list/large_list/fixed_size_list rows when the source Arrow array is sliced (parent.offset > 0) [arrow-adbc]
via GitHub
Re: [I] c/driver/postgresql: adbc_ingest silently misaligns list/large_list/fixed_size_list rows when the source Arrow array is sliced (parent.offset > 0) [arrow-adbc]
via GitHub
[I] [Docs][C++][Parquet] Add API reference [arrow]
via GitHub
[I] [R][Packaging] Support building the R package under Emscripten [arrow]
via GitHub
Re: [I] [R][Packaging] Support building the R package under Emscripten [arrow]
via GitHub
[I] docker-amd64-ubuntu-memcheck verify job is failing [arrow-nanoarrow]
via GitHub
[I] New warning on gcc16 [arrow-nanoarrow]
via GitHub
Re: [I] New warning on gcc16 [arrow-nanoarrow]
via GitHub
Re: [I] [C++] Output batch size control in ExecPlan [arrow]
via GitHub
[I] [C++][Gandiva] Add 2 arg REGEXP_EXTRACT function [arrow]
via GitHub
[I] Support `expr.IntervalYearToMonthLiteral` in `literalToDatum` [arrow-go]
via GitHub
Re: [I] Support `expr.IntervalYearToMonthLiteral` in `literalToDatum` [arrow-go]
via GitHub
Re: [I] [C++] Improve error handling for hash table merges [arrow]
via GitHub
[I] [Java] Improve VectorSchemaRoot.getVector(String name) lookup performance [arrow-java]
via GitHub
[I] Explicitly providing CMAKE_LIBTOOL does not work on MacOS [arrow]
via GitHub
Re: [I] [C++] Explicitly providing CMAKE_LIBTOOL does not work on MacOS [arrow]
via GitHub
Re: [I] [C++] DictionaryArray::dictionary() is not thread safe [arrow]
via GitHub
Re: [I] [Document] Why int32() offset type is used for DenseUnionArray? [arrow]
via GitHub
Re: [I] [C++] CSV reader: Ability to not infer column types. [arrow]
via GitHub
[I] Fix remaining overflow and negative length handling issues in Gandiva string functions [arrow]
via GitHub
[I] Azure with SAS Keys [arrow]
via GitHub
[I] [GLib] Enable tests for custom extension data type [arrow]
via GitHub
[I] [Python][CI] Raise oldest NumPy wheel-test requirement to a patched release [arrow]
via GitHub
Re: [I] [Python][CI] Raise oldest NumPy wheel-test requirement to a patched release [arrow]
via GitHub
[I] [C++] IPC file fuzzer fails when footer schema has differing endianness [arrow]
via GitHub
[I] Question regarding Parquet Page Index: Why enable it during write if it's not utilized during read? [arrow-go]
via GitHub
Re: [I] [C++][Acero] Window Functions add helper classes for window aggregates and distinct aggregates [arrow]
via GitHub
Re: [I] [C++][Acero] Window Functions add helper classes for quantiles [arrow]
via GitHub
Re: [I] [C++][Acero] Add Window Functions exec node [arrow]
via GitHub
Re: [I] [C++/Python] Add support for S3 Bucket Versioning [arrow]
via GitHub
[I] [Avro] hamba/avro is abandoned [arrow-go]
via GitHub
[I] [Python] Improve Extension Types Support in PyArrow (umbrella issue) [arrow]
via GitHub
Re: [I] [Python] Subclassing the PyExtensionType and getting it's bit_width attribute returns Non-fixed width type ValueError [arrow]
via GitHub
[I] The annotation is incorrect. It should be 1M. [arrow-go]
via GitHub
Re: [I] The annotation is incorrect. It should be 1M. [arrow-go]
via GitHub
[I] [C++][Parquet] Avoid unbounded temp alloc in BYTE_STREAM_SPLIT decoder [arrow]
via GitHub
Re: [I] [C++] Support optional arguments in aggregation function mapping in the Substrait consumer. [arrow]
via GitHub
Re: [I] [R] Differing results in log bindings [arrow]
via GitHub
Re: [I] [Python][Dev] Document the process to run numpydoc checks [arrow]
via GitHub
Re: [I] [R] Implement asof join [arrow]
via GitHub
Re: [I] Clean up how the CSV reader handles the first buffer [arrow]
via GitHub
Re: [I] [R] Tidy up the pkgdown articles site index [arrow]
via GitHub
Re: [I] [R] arrow_eval: do we need both nse_funcs and .cache$functions? [arrow]
via GitHub
Re: [I] [C++] [Python] Major performance improvements to CSV reading from S3 [arrow]
via GitHub
Re: [I] [R] Table viewer for knitr/notebooks [arrow]
via GitHub
[I] [C++][Dataset] std::bad_weak_ptr in multi-threaded writer tests on MinGW gcc-16 [arrow]
via GitHub
Re: [I] [C++][CI] MinGW GCC 16.1 regression - shared_ptr corruption in multi-threaded tests [arrow]
via GitHub
[I] Managing ownership in VectorSchemaRoot#addVector, recent changes miss the main fault. [arrow-java]
via GitHub
Re: [I] [R] [Docs] Improve (or really actually document) our Python bridge documentation [arrow]
via GitHub
Re: [I] [C++] Fetch Node Substrait Integration [arrow]
via GitHub
[I] [C++][Parquet] Reading dictionary encoded boolean throws NYI [arrow]
via GitHub
Re: [I] [C++] Substarit End-To-End Tests for Relations [arrow]
via GitHub
Re: [I] [R] Allow unrecognized R expressions to be callable as compute::Functions [arrow]
via GitHub
Re: [I] [R] Add vignette on ExecPlans and how they work [arrow]
via GitHub
Re: [I] [Python] Memory kept after del and pool.released_unused() [arrow]
via GitHub
Re: [I] Does arrow support access S3 based on 'path-style'? [arrow]
via GitHub
Re: [I] [C++] RecordBatch Make() with Arrow Arrays could infer length [arrow]
via GitHub
Re: [I] [C++][Parquet] Support nested data conversions for chunked array [arrow]
via GitHub
[I] [GLib] garrow_data_type_new_raw segfaults on arrow::extension::OpaqueType and any non-GLib ExtensionType (ADBC PostgreSQL NUMERIC) [arrow]
via GitHub
Re: [I] [GLib] garrow_data_type_new_raw segfaults on arrow::extension::OpaqueType and any non-GLib ExtensionType (ADBC PostgreSQL NUMERIC) [arrow]
via GitHub
[I] [C++] Uncontrolled Memory Allocation (OOM) in Parquet Delta decoders [arrow]
via GitHub
[I] [C++][Gandiva] Use timegm in date_time_test utilities to avoid DST-dependent behavior [arrow]
via GitHub
Re: [I] [Python] `compute.count_distinct` not implemented for `extension<arrow.uuid>` and `extension<arrow.json>` [arrow]
via GitHub
Re: [I] [Python] `compute.min_max` is not implemented for `extension<arrow.json>` [arrow]
via GitHub
[I] [Bug] NewIntXStatistics factories unconditionally set hasDistinctCount=true, causing distinct_count=0 to always appear in Parquet output [arrow-go]
via GitHub
Re: [I] [Bug] NewIntXStatistics factories unconditionally set hasDistinctCount=true, causing distinct_count=0 to always appear in Parquet output [arrow-go]
via GitHub
[I] [C++] HeadBucket called in S3FS breaking IAM scoped prefixes [arrow]
via GitHub
Re: [I] [R] Implement typeof() in Arrow dplyr queries [arrow]
via GitHub
Re: [I] [R] Implement as.integer and as.numeric for timestamp types etc. in Arrow dplyr queries [arrow]
via GitHub
Re: [I] [R]: Lack of `assume_timezone` binding [arrow]
via GitHub
Re: [I] [C++] Move Parquet APIs to use Result instead of Status [arrow]
via GitHub
Re: [I] [C++][Python][Doc] Document that order is not preserved when writing dataset with use_threads=True [arrow]
via GitHub
Re: [I] [C++][Python] SEGFAULT when casting FixedSizeTensorArray to storage type then back to FixedSizeTensorArray [arrow]
via GitHub
Re: [I] [Python] ParquetWriter use_compliant_nested_type=True does not preserve ExtensionArray when reading back [arrow]
via GitHub
Re: [I] [Python] `pyarrow.Table.to_pandas` creates Index instead of PeriodIndex [arrow]
via GitHub
[I] [C++][CI] gcc-16 MinGW failures - remaining fixes (follow-up to #49930) [arrow]
via GitHub
Re: [I] [C++][CI] gcc-16 MinGW failures - remaining fixes (follow-up to #49930) [arrow]
via GitHub
[I] [Format] Better document IPC file and stream equivalence [arrow]
via GitHub
Earlier messages