github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/02/18
Re: [I] [Python] Feature to append row groups to existing parquet file [arrow]
via GitHub
2026/02/18
Re: [I] [Python] For extension types, compute kernels should default to storage types? [arrow]
via GitHub
2026/02/18
Re: [I] [R] Allow setting field metadata [arrow]
via GitHub
2026/02/18
Re: [I] [R] User experience improvements [arrow]
via GitHub
2026/02/18
Re: [I] [C++][R][Python] Use ISO 8601 in character representations of timestamps? [arrow]
via GitHub
2026/02/18
Re: [I] [R] GCS/S3 Improvements [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Docs] Add examples of Parquet TypedColumnWriter to user guide [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Use saved pandas metadata to determine default timestamp_as_object in to_pandas() [arrow]
via GitHub
2026/02/18
Re: [I] [C++][CI] Add Substrait integration testing to CI [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Compute] Support KEEP_NULL option for compute::Filter [arrow]
via GitHub
2026/02/18
Re: [I] [R] Make it more obvious how to read in a Parquet file with a different schema to the inferred one [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Populate Substrait producer version from cmake config variables [arrow]
via GitHub
2026/02/18
Re: [I] [Python] registering new data formats [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Provide more informative error when (CSV/JSON) parsing fails [arrow]
via GitHub
2026/02/18
Re: [I] [R] Implement functionality to read fixed-width files [arrow]
via GitHub
2026/02/18
Re: [I] [Python][Packaging] Simplify Numpy resolution on python/requirements-wheel-test.txt [arrow]
via GitHub
2026/02/18
Re: [I] [Docs][Release] Update verification information for CentOS7 [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Vector kernel for "intersecting" two arrays (all common elements) [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Acero buffer alignment [arrow]
via GitHub
2026/02/18
Re: [I] Dictionary Style array for Keywords or Tags [arrow]
via GitHub
2026/02/18
Re: [I] Remove ad-hoc substrait version after substrait#342 [arrow]
via GitHub
2026/02/18
Re: [I] [Dev][CI] Make nightly group as an alias of nightly-* [arrow]
via GitHub
2026/02/18
Re: [I] Allow ConvertOptions.timestamp_parsers for date types [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add a "list_contains" kernel [arrow]
via GitHub
2026/02/18
Re: [I] Check for broken links on generated sites [arrow]
via GitHub
2026/02/18
Re: [I] Change the way how arrow reads IPC buffered files [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Python] Custom streaming data providers in {{run_query}} [arrow]
via GitHub
2026/02/18
Re: [I] [Archery][CI] Refactor git dependencies used on archery to be more consistent [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Substrait consumer should reject plans containing options that it doesn't recognize [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Use BUILD_TESTING=OFF for abseil-cpp [arrow]
via GitHub
2026/02/18
Re: [I] [Format] archery lint for cmake should show error details [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add validation to ExecBatch [arrow]
via GitHub
2026/02/18
Re: [I] [Python][C++] Add ability for python to specify sink node when running Substrait [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Provide a way to specify the type of a subset of columns for from_pandas [arrow]
via GitHub
2026/02/18
Re: [I] [R] native type checking in where() [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Always use optimization flags for SIMD related codes [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Implement casting to dictionary type (dictionary_encode as a cast) [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add read/write optimization for pyarrow.fs.S3FileSystem [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Dataset] Optimize Parquet column projection for subset of nested field [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Change the base directory for PyArrow CPP header files [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Use ExtensionScalar.as_py() as fallback in ExtensionArray to_pandas? [arrow]
via GitHub
2026/02/18
Re: [I] [R] arrow_eval user-defined generic functions [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Allow disabling more components [arrow]
via GitHub
2026/02/18
Re: [I] Add Intel®-IAA/QPL-based Parquet RLE Decode [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Consider dictionary arrays for special fragment fields [arrow]
via GitHub
2026/02/18
Re: [I] Built-in GRPC health checks in FlightServerBase [arrow]
via GitHub
2026/02/18
Re: [I] Writing Arrow Files using C#. [arrow]
via GitHub
2026/02/18
Re: [I] [Packaging][Conan] Add back ARROW_GCS to conanfile.py [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Python] Allow an ExtensionType to register or implement custom casts [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Stabilize Parquet ArrowReaderProperties [arrow]
via GitHub
2026/02/18
Re: [I] [Python] ExtensionArray.__getitem__ is not called if called from StructArray [arrow]
via GitHub
2026/02/18
Re: [I] [c++][compute]Is there any other way to use Join besides Acero? [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Docs] Improve C++ Cookbook [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Improve parquet reading performance for String/Binary type based on Buffer operations instead of BinaryArrayBuilder [arrow]
via GitHub
2026/02/18
Re: [I] Implement zip() [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add ordering information to exec batches [arrow]
via GitHub
2026/02/18
Re: [I] [R] Simultaneous read-write operations causing file corruption. [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Implement arithmetic kernels on List(number) [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Improve error message when all values in a column are null in a parquet partition [arrow]
via GitHub
2026/02/18
Re: [I] [Website] Add Zulip details to the Communication page [arrow]
via GitHub
2026/02/18
Re: [I] [C++] AsofJoinNode 128-bit hashing [arrow]
via GitHub
2026/02/18
Re: [I] [R] Add link to cookbook from README (getting started vignette) [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Gandiva] Support int64 seed for udf random. [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Expose jemalloc statistics for logging [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Gandiva] Add parser frontend for Gandiva [arrow]
via GitHub
2026/02/18
Re: [I] [R] Pre-render vignettes [arrow]
via GitHub
2026/02/18
Re: [I] [Dev][CI] Add overview of all tasks (including passing) on crossbow dashboard [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Acero] Window Functions add helper classes for frame calculation [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add ScanOptions to support projection and filter in ToProto Read [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Allow Bazel to pass custom __DATE__, __TIME__, and __TIMESTAMP__ flags to Arrow's toolchain [arrow]
via GitHub
2026/02/18
Re: [I] [R] Streamline some C++ calls [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Order-aware non-sink Fetch Node [arrow]
via GitHub
2026/02/18
Re: [I] Relax / extend type checking for pyarrow array creation [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Acero] Window Functions add helper classes for ranking [arrow]
via GitHub
2026/02/18
Re: [I] [R] Add binding for random() function [arrow]
via GitHub
2026/02/18
Re: [I] [C++] ReadRangeCache should not retain data after read [arrow]
via GitHub
2026/02/18
Re: [I] [R] Refactor build_expr and eval_array_expression to remove special casing [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] Refactor custom cmake functions into proper modules [arrow]
via GitHub
2026/02/18
Re: [I] [C++] FieldRef::FindAll/FindOne(DataType) improve error [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Optimize DELTA_BINARY_PACKED encoding and decoding [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Scanner slicing large row groups leads to inefficient RAM usage [arrow]
via GitHub
2026/02/18
Re: [I] [Gandiva][Dev] Check version of OpenSSL for Gandiva [arrow]
via GitHub
2026/02/18
Re: [I] [R] Rename read_ipc_file to read_arrow_file & highlight arrow over feather [arrow]
via GitHub
2026/02/18
Re: [I] [R][Docs] Add docs on what dplyr + tidyverse functionality we support [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add opaque device id identification to InputStream [arrow]
via GitHub
2026/02/18
Re: [I] [CI][Python][Conda] Can't load Gandiva on macOS [arrow]
via GitHub
2026/02/18
Re: [I] [R] Allow all cast options to be specified [arrow]
via GitHub
2026/02/18
Re: [I] [R] Feature request: add support for saving row names [arrow]
via GitHub
2026/02/18
Re: [I] [R] [Docs] [CI] Investigate if we can auto generate Rd files in CI [arrow]
via GitHub
2026/02/18
Re: [I] [Python][Packaging] Wrong ARROW_SIMD_LEVEL=SSE4_2 on arm64 macOS wheels [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Support Decimal from Int32/Int64 in StatisticsAsScalars [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Speed up Parquet Writing? [arrow]
via GitHub
2026/02/18
Re: [I] [GLib][Dataset] Add GADatasetFilenamePartitioning [arrow]
via GitHub
2026/02/18
Re: [I] [R][CI] Add GitHub PAT to jobs that are reaching limit [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Raise IndexError when pa.Schema.get_field_index fails [arrow]
via GitHub
2026/02/18
Re: [I] [Parquet] Support for writing binary column in stream writer in Parquet [arrow]
via GitHub
2026/02/18
Re: [I] [Parquet][C++] More elaborate dictionary fallback for Parquet 2.0 [arrow]
via GitHub
2026/02/18
Re: [I] [Documentation] Provide guidance to contributors on getting reviews [arrow]
via GitHub
2026/02/18
Re: [I] Provide a `BinaryBuilder::AppendValues(const std::vector<std::vector<uint8_t>>&)` overload [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Add pyarrow.TableGroupBy.groups method [arrow]
via GitHub
2026/02/18
Re: [I] Would it be possible to include cmake export targets in pyarrow wheel file? [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add an option for the order by node to be stable [arrow]
via GitHub
2026/02/18
Re: [I] [CI] Create suggestions comments in lint job [arrow]
via GitHub
2026/02/18
Re: [I] [CI][conda] don't build pyarrow in r-jobs [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Document asof join [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Add pa.tuple_ DataType [arrow]
via GitHub
2026/02/18
Re: [I] [Release] Use GitHub API token in download_rc_binaries.py where available [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Weighted stat aggregations in arrow-compute [arrow]
via GitHub
2026/02/18
Re: [I] [C++][CMake] Use cpp/src/arrow/util/config.h.cmake instead of add_defintions() for ARROW_WITH_${COMPRESESION} [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Add a pyarrow.Table.aggregate function to compute aggregates against the whole table [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Simplify ExecNode contract by removing the concept of "node finished" [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Enable Substrait ReadRel Projection in Acero [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Refactor string matching kernel to be usable outside of compute [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add optional "always_transfer" parameter to MakeTransferredGenerator [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] Zulip streams for GitHub issues [arrow]
via GitHub
2026/02/18
Re: [I] UnionArray.type_codes is never null [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Enable integrity check on S3 upload [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] PR title check linter checkes "[COMPONENT]" existence [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] Add tests for GitHub issues on merge script [arrow]
via GitHub
2026/02/18
Re: [I] [Archery] Use gh cli for artifact management [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] Add dependabot configuration to remove duplicated labels [arrow]
via GitHub
2026/02/18
Re: [I] [R] pkgdown site improvements [arrow]
via GitHub
2026/02/18
[I] Add GenericByteViewArray::total_bytes_len() to sum all string lengths [arrow-rs]
via GitHub
2026/02/18
Re: [I] [Benchmarking][R] conbench is failing [arrow]
via GitHub
2026/02/18
Re: [PR] Avoid allocating a `Vec` in `StructBuilder` [arrow-rs]
via GitHub
2026/02/18
Re: [I] Avoid allocating a `Vec` in `StructBuilder` [arrow-rs]
via GitHub
2026/02/18
Re: [I] [C++] Segmentation fault in pyarrow.compute.if_else: arguments have string type but the result only fits in large_string type [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Native result set adapter for PostgreSQL / libpq [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Native client interface to SQL Server / TDS protocol [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Disable ASAN when building io-hdfs-test.cc [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Appending to streamable table file format doesn't seem to work [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Native client interface to Clickhouse [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Update the documentation about Schema & Metadata usage [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Support LTO for R [arrow]
via GitHub
2026/02/18
Re: [PR] fix: fixed trait functions clash get_date_time_part_extract_fn (#8221) [arrow-rs]
via GitHub
2026/02/18
Re: [I] [C++] Enable copy free conversion for dictionary encoded string column in ORC adapter [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Native database client for MariaDB / MySQL client protocol [arrow]
via GitHub
2026/02/18
Re: [I] [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Reading Parquet file crashes on windows - python3.8 [arrow]
via GitHub
2026/02/18
Re: [I] [C++][ORC] Enable copy free conversion for primitive types [arrow]
via GitHub
2026/02/18
Re: [I] [Python][C++] MemoryPool is destructed before deallocating its buffers leads to segfault [arrow]
via GitHub
2026/02/18
Re: [I] [Python] pyarrow deserialize return datetime.datetime [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Lose access to indices & dictionary roundtripping DictionaryArray to parquet file [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Manual dataset with timestamp partition type error [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Python] Behavior of parquet.read_table with filter and parquets containing null [arrow]
via GitHub
2026/02/18
Re: [I] [R] If pkg-config finds arrow on default search path, we don't know if it was built with ARROW_S3 [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Python] Python compute kernel tests assume C++ is built with utf8proc [arrow]
via GitHub
2026/02/18
Re: [I] [Python] pyarrow2.0.0 flight test crash on macOS [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Micro-optimize integer parsing [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Getting reference not found with ORC enabled pyarrow [arrow]
via GitHub
2026/02/18
Re: [I] [Python][Packaging] Fix Homebrew Install Python 3 NumPy not found failure [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Timestamp ColumnDescriptor (from logical type) incorrectly showing ConvertedType as NONE [arrow]
via GitHub
2026/02/18
Re: [I] [Integration] Enable Arrow to read Parquet files from Spark 2.x with illegal nulls [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Parquet reader cannot read large strings [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Compute] Overhaul CanCast() helper function [arrow]
via GitHub
2026/02/18
Re: [I] Out-of-heap memory leaks in FlightClient.getStream [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Compilation failure in arrow/scalar.cc on Xcode 8.3.3 [arrow]
via GitHub
2026/02/18
Re: [I] [Website] Write blog post about C++ endianness compatibility [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Dict index type ALWAYS gets coerced to int32 when saving to parquet [arrow]
via GitHub
2026/02/18
Re: [I] PyArrow unable to read file with large string values [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Initial table.take(...) call takes much longer [arrow]
via GitHub
2026/02/18
Re: [I] [C++] CSV streaming reader doesn't handle cancellation correctly [arrow]
via GitHub
2026/02/18
Re: [I] [C++] arrow-threading-utility-test takes a long time [arrow]
via GitHub
2026/02/18
Re: [I] [R] Build fails if dataset enabled but parquet is not [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] StatisticsAsScalars doesn't support Decimal conversion for int primitives [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Root message of parquet may contain repetition [arrow]
via GitHub
2026/02/18
Re: [I] [C++] [Parquet] Primitive types have defined num_children [arrow]
via GitHub
2026/02/18
Re: [I] [C++] [Python] Python tests fail if compiled with glog [arrow]
via GitHub
2026/02/18
Re: [I] [C++] C++ IPC reading looks like it doesn't support uncompressed buffer convention for compressed buffers [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Gandiva] Performance issue for TreeExprBuilder::MakeIf when nested plenty times. [arrow]
via GitHub
2026/02/18
Re: [I] [Doc] Update crossbow docs for archery [arrow]
via GitHub
2026/02/18
Re: [I] [R] Writing to Parquet from tibble Consumes Large Amount of Memory [arrow]
via GitHub
2026/02/18
Re: [I] [Python][C++] S3FileSystem with proxy_options is very slow on Windows [arrow]
via GitHub
2026/02/18
Re: [I] [Python][C++] pa.total_allocated_bytes incorrect after switching the default allocator [arrow]
via GitHub
2026/02/18
Re: [I] [Python] TypeError when accessing length of an invalid ListScalar [arrow]
via GitHub
2026/02/18
Re: [I] RecordBatchBuilder with uint dictionary creates signed int Batch [arrow]
via GitHub
2026/02/18
Re: [I] OSError: Invalid IPC stream: negative continuation token [arrow]
via GitHub
2026/02/18
Re: [I] [Python] StructScalar Timestamp using .to_pandas() loses/converts type [arrow]
via GitHub
2026/02/18
Re: [I] [Documentation] SEO tags confused for some pages [arrow]
via GitHub
2026/02/18
Re: [I] [Python] bool value of scalars depends on data type [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Warning when compiling on ubunut 21.04 [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Non-nullable schema fields not checked in Table.from_pydict [arrow]
via GitHub
2026/02/18
Re: [I] [C++] ArrowLog with FATAL level is not robust if running in the service [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Thread pool leaks memory when forking (and could maybe deadlock) if threads exist at the time of fork [arrow]
via GitHub
2026/02/18
Re: [I] [Python] HadoopFileSystem crash when called twice and Java was misconfigured [arrow]
via GitHub
2026/02/18
Re: [I] [C++] StructArray ToString method doesn't print field names [arrow]
via GitHub
2026/02/18
Re: [I] [CI] [C++] TestToDateHolder test error [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Bump AWS SDK versions in ThirdpartyToolchain to build on GCC11 [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] StreamReader.SkipColumns slow [arrow]
via GitHub
2026/02/18
Re: [I] [Dev] r_valgrind image doesn't use full parallelism [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Breaking API change in FSSpecHandler, requires metadata argument [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Add DataType.to_numpy_dtype (equivalent of to_pandas_dtype, but for numpy) [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Parquet] Reading dict pages is not reading all values? [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Tests maybe uninitialized compiler warnings [arrow]
via GitHub
2026/02/18
Re: [I] [C++] [Python] Dictionary equality not correct? [arrow]
via GitHub
2026/02/18
Re: [I] Shared libraries linker error when using clang, C++ 20, and ld [arrow]
via GitHub
2026/02/18
Re: [I] [C++] Add async version of the ORC Dataset scanner [arrow]
via GitHub
2026/02/18
Re: [I] [Python] Inconsistent handling of integer-valued partitions in dataset filters API [arrow]
via GitHub
2026/02/18
Re: [I] [C++] ThreadIndexer occasionally fails in CI with "Check failed: (thread_index) < (Capacity())" [arrow]
via GitHub
2026/02/18
Re: [I] [C++][Python] Generated argument description for compute meta-functions not accurate [arrow]
via GitHub
Earlier messages
Later messages