issues
Thread
Date
Earlier messages
Later messages
Messages by Thread
[I] [C++][FlightSQL][ODBC] MSVC CI caching issues [arrow]
via GitHub
Re: [I] [C++][Parquet] Develop external predicate pushdown API for column readers [arrow]
via GitHub
Re: [I] Website returns 404 for docs version 16.0 [arrow]
via GitHub
[I] [C++] C++20: use standard calendar / timezone APIs [arrow]
via GitHub
Re: [I] [Python] Fastest way to handle csv file w/ column mismatch [arrow]
via GitHub
[I] [C++] C++20: use starts_with/ends_with methods [arrow]
via GitHub
[I] [C++] C++20: use standard bit utils [arrow]
via GitHub
[I] [C++] C++20: add/use concepts [arrow]
via GitHub
[I] [C++] C++20: use std::bit_cast [arrow]
via GitHub
[I] [C++] C++20: use std::span [arrow]
via GitHub
[I] [C++] C++20 modernization [arrow]
via GitHub
Re: [I] [C++][Docs] Update minimum GCC to 8 and C++ standard to C++20 [arrow]
via GitHub
Re: [I] [Developer][Documentation] Organize source and binary dependency licenses into directories [arrow]
via GitHub
Re: [I] [C++][Python] Set up testing for backwards compatibility of the parquet reader [arrow]
via GitHub
Re: [I] [C++] Remove compute pointer aliases [arrow]
via GitHub
Re: [I] [C++] Add Result<T> to the Visitor pattern [arrow]
via GitHub
Re: [I] [C++][Parquet] Implement non-vectorized array reconstruction logic. [arrow]
via GitHub
Re: [I] [C++][Parquet] Key rotation tool [arrow]
via GitHub
Re: [I] [C++][Parquet] Large decimal values don't roundtrip correctly [arrow]
via GitHub
Re: [I] [C++] Use feature enum [arrow]
via GitHub
Re: [I] [C++][Parquet] Create randomized nested data generation round trip read/write unit tests [arrow]
via GitHub
Re: [I] [C++][Gandiva] Add CMake support for compiling LLVM's IR into a library [arrow]
via GitHub
Re: [I] [Archery] Comment bot should report any errors happening during crossbow submit [arrow]
via GitHub
Re: [I] [Python] Serialising numpy array yields `pyarrow.lib.ArrowNotImplementedError: list<item: float>` [arrow]
via GitHub
Re: [I] [Python] Conversion of numpy array to pyarrow.Tensor: Negative ndarray strides not supported [arrow]
via GitHub
Re: [I] [C++] -Dzlib_SOURCE=BUNDLED on Windows does not produce arrow.dll with zlib statically linked [arrow]
via GitHub
Re: [I] [Dataset][C++] RecordBatchProjector is not thread safe [arrow]
via GitHub
Re: [I] [C++][Dataset] Implement ScalarAsStatisctics for non-primitive types [arrow]
via GitHub
Re: [I] [C++] clang-tidy diagnostics not emitted for most headers [arrow]
via GitHub
Re: [I] [C++] Gandiva exposes LLVM symbols [arrow]
via GitHub
Re: [I] [Python] Writing partitions with NaNs silently drops data [arrow]
via GitHub
Re: [I] [C++][Dataset] Give more informative error message for mismatching schemas for FileSystemSources [arrow]
via GitHub
Re: [I] [Python] Expose dataset PartitioningFactory.inspect ? [arrow]
via GitHub
Re: [I] [Python] csv.ConvertOptions Do Not Pass Through/Retain Nullability from Schema [arrow]
via GitHub
Re: [I] [Python] csv.ConvertOptions Documentation Is Unclear Around Disabling Type Inference [arrow]
via GitHub
Re: [I] [C++][Parquet] 1.4.0+ reader ignore stats created by 1.3.* writer [arrow]
via GitHub
Re: [I] [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection [arrow]
via GitHub
Re: [I] [FlightRPC][C++] DoPutPayloadWriter doesn't always expose server error message [arrow]
via GitHub
Re: [I] [Developer][C++] IWYU fails on include-cycle in uriparser/Uri.h [arrow]
via GitHub
Re: [I] [C++] gcc6 warning re: arrow::internal::ArgSort [arrow]
via GitHub
Re: [I] [C++][CSV] Issue building CSV component under GCC 6.1.0 [arrow]
via GitHub
Re: [I] [Docs] Integration testing instructions for base docker image are incorrect [arrow]
via GitHub
Re: [I] [Python] hdfs fails to connect to for HDFS 3.x cluster [arrow]
via GitHub
Re: [I] [Python] Empty table creation from schema with nested dictionary type [arrow]
via GitHub
Re: [I] [C++][Python] Make reading functions to return consistent exceptions [arrow]
via GitHub
Re: [I] [C++][Compute] Provide a kernel property testing API [arrow]
via GitHub
Re: [I] [C++] Default display for multi-choice define_option_string is misleading [arrow]
via GitHub
Re: [I] [C++] Unable to load libjvm on ppc64le architecture for hdfs.connect() [arrow]
via GitHub
Re: [I] [C++] Writing IPC messages with 64-byte buffer alignment vs. 8-byte default [arrow]
via GitHub
Re: [I] [Python] read_csv() case of user specified column_names AND include_columns [arrow]
via GitHub
Re: [I] [C++] Column type inference in read_csv vs. open_csv. CSV conversion error to null [arrow]
via GitHub
Re: [I] [C++] overloaded virtual function "arrow::io::Writable::Write" is only partially overridden in class [arrow]
via GitHub
Re: [I] [Python] pandas index information gets lost when partition_cols are used [arrow]
via GitHub
Re: [I] [Python] Specifying columns in a dataset drops the index (pandas) metadata. [arrow]
via GitHub
Re: [I] [Python] Column names of type CategoricalIndex fails to convert back to pandas [arrow]
via GitHub
Re: [I] [C++] jemalloc_set_decay_ms precedence [arrow]
via GitHub
Re: [I] [Python] parquet.read_table causes crashes on Windows Server 2016 w/ Xeon Processor [arrow]
via GitHub
[I] [Format][C++] Add tensor and sparse tensor supports in File metadata [arrow]
via GitHub
Re: [I] [Format][C++] Add tensor and sparse tensor supports in File metadata [arrow]
via GitHub
Re: [I] [C++/Python] S3FileSystem.create_dir should raise for a nested directory with recursive keyword set to False [arrow]
via GitHub
Re: [I] [C++] Raw data equality in arrays vs. semantic value equality [arrow]
via GitHub
Re: [I] [Website] Transition to new .asf.yaml machinery for website publishing [arrow]
via GitHub
Re: [I] [C++] Extending STL API to support row-wise conversion [arrow]
via GitHub
Re: [I] [Python] Add Array ctor microbenchmarks [arrow]
via GitHub
Re: [I] [C++] CSV reader accept schema [arrow]
via GitHub
Re: [I] [Crossbow] Unify the version numbers generated by crossbow and rake [arrow]
via GitHub
Re: [I] [C++] More extensive attributes usage could improve debugging [arrow]
via GitHub
Re: [I] [Python] Consider adding some user-friendly conveniences to Filesystem API [arrow]
via GitHub
Re: [I] [C++] Create "ARROW_LIBRARIES" argument to pass list of desired components to build [arrow]
via GitHub
Re: [I] [C++] In CMake output, list each enabled thirdparty toolchain dependency and the reason for its being enabled [arrow]
via GitHub
Re: [I] [Python] Define PyObjectBuffer with Py_XDECREF logic in destructor for object array memory [arrow]
via GitHub
Re: [I] [Packaging][Crossbow] Always upload binary artifacts regardless of the test result [arrow]
via GitHub
Re: [I] [CI] Turn off unnecessary features in the integration tests (spark/turbodbc/dask/hdfs) [arrow]
via GitHub
Re: [I] [C++][Dataset] Handle DictType index mismatch better [arrow]
via GitHub
Re: [I] [C++][CMake] Automatically set ARROW_GANDIVA_PC_CXX_FLAGS for conda and OSX sdk [arrow]
via GitHub
Re: [I] [Archery] Create a wrapper script in archery for docker compose in order to run the containers with the host's user and group [arrow]
via GitHub
Re: [I] [C++][Dataset] Ability to restrict Hive partitioning to certain fields [arrow]
via GitHub
Re: [I] [C++] Add a facility to create a Bitmap buffer from an data pointer with a specified sentinel [arrow]
via GitHub
Re: [I] [C++][Documentation] Link to generated Doxygen docs from main Sphinx TOC tree [arrow]
via GitHub
Re: [I] [C++] Deduplicate schema equivalence checks [arrow]
via GitHub
Re: [I] [Python] Define API for user-defined conversions of array cell values in pyarrow.array [arrow]
via GitHub
Re: [I] [Developer][Integration] Consolidate example JSON and test/validate uniformly [arrow]
via GitHub
Re: [I] [Release] Ensure that the JIRAs belonging the release's commits have the proper version number [arrow]
via GitHub
Re: [I] [C++] Create CMake utility to streamline creating ADD_$COMPONENT_TEST helper functions [arrow]
via GitHub
Re: [I] [C++] Add "ON only if system dependencies available" build mode for certain optional Arrow components [arrow]
via GitHub
Re: [I] [C++] Consider implementing BufferOuputStream using BufferBuilder internally [arrow]
via GitHub
Re: [I] [Python] arrow_to_pandas.cc has separate code paths for populating list<T> values into an object array [arrow]
via GitHub
Re: [I] [C++] Support dictionary unification on dictionaries having nulls [arrow]
via GitHub
Re: [I] [Release] Document environment configuration to run release verification on macOS [arrow]
via GitHub
Re: [I] [C++] Provide API for IPC roundtrip of RecordBatches not using the encapsulated message format [arrow]
via GitHub
Re: [I] [R] Explore roxygen2 R6 class documentation [arrow]
via GitHub
Re: [I] [Archery] Cleanup integration module to use companion classes [arrow]
via GitHub
Re: [I] [C++/Python] retrieve fd of open memory mapped file and Open() memory mapped file by fd [arrow]
via GitHub
Re: [I] [R] Add col_select argument to read_ipc_stream [arrow]
via GitHub
Re: [I] [Archery] Benchmark diff should provide a TUI friendly output [arrow]
via GitHub
Re: [I] [Developer] Add Windows utility script to use Dependencies.exe to dump DLL dependencies for diagnostic purposes [arrow]
via GitHub
Re: [I] [C++][CI] Hiveserver2 instegration test fails to connect to impala container [arrow]
via GitHub
Re: [I] [R] Add option to preserve dictionary logical type rather than coerce to factor [arrow]
via GitHub
Re: [I] [Python] Allow HDFS FileSystem to be created without Hadoop present [arrow]
via GitHub
Re: [I] [C++] Simplify build-support/run-test.sh [arrow]
via GitHub
Re: [I] [C++][Documentation] Document how to set installed location for individual toolchain components [arrow]
via GitHub
Re: [I] [C++][Parquet] Optional parallel processing when writing Parquet files [arrow]
via GitHub
Re: [I] [C++][Parquet] Examine Arrow-decoding perf regressions introduced by PARQUET-1797 [arrow]
via GitHub
Re: [I] [C++] Add field to IpcReadOptions to include padding in Buffer metadata accounting [arrow]
via GitHub
Re: [I] [C++] Add short representation string to common classes [arrow]
via GitHub
Re: [I] [Python] Refactor context_choices in test_cuda_numba_interop to be a module level fixture [arrow]
via GitHub
Re: [I] [C++] Sanitize hdfs host when creating HadoopFileSystem from endpoint [arrow]
via GitHub
Re: [I] [Python] Externalize option whether to bundle zlib DLL in Python packages [arrow]
via GitHub
Re: [I] [C++] Simplify IPC tests by using BufferOutputStreams [arrow]
via GitHub
Re: [I] [C++] Add "random access" / slice read API to RecordBatchFileReader [arrow]
via GitHub
Re: [I] [C++][Parquet] Add benchmarks for rep/def level decoding at multiple levels [arrow]
via GitHub
Re: [I] [C++] Implement "round robin" scheduler interface to fixed-size ThreadPool [arrow]
via GitHub
Re: [I] [C++] Add support for gflags version detection [arrow]
via GitHub
Re: [I] [C++][Gandiva] Reduce number of files and headers [arrow]
via GitHub
Re: [I] [Doc] General introduction to archery [arrow]
via GitHub
Re: [I] [C++][Parquet] Require error message when using ParquetException::EofException [arrow]
via GitHub
Re: [I] [C++] Add benchmarks for arrow/util/rle_encoder.h for non-dictionary encodings. [arrow]
via GitHub
Re: [I] [Python][Dataset] Consider adding Cast like operation [arrow]
via GitHub
Re: [I] [C++] Selective compression on the wire [arrow]
via GitHub
Re: [I] [Python] Get Access to the type_to_type_id dictionary [arrow]
via GitHub
Re: [I] [Gandiva][UDF] Solutions to register new UDFs dynamically without checking it into arrow repo. [arrow]
via GitHub
Re: [I] [Python][Dataset] Infer the filesystem from the first path if multiple paths are passed to dataset() [arrow]
via GitHub
Re: [I] [C++][Parquet] Expose an API that surface RLE information for rep/def levels when reading parquet files [arrow]
via GitHub
Re: [I] [C++][Parquet] Expose an API that allows direct writing of RLE information for rep/def levels when writing parquet files [arrow]
via GitHub
Re: [I] [C++] Listing files with S3FileSystem is slow [arrow]
via GitHub
Re: [I] [Gandiva][UDF] Add a udf for gandiva to extract all named groups. [arrow]
via GitHub
Re: [I] [Python][R] Expose incremental write API for Feather files [arrow]
via GitHub
Re: [I] [Gandiva][UDF] Support complex datatype for UDF return type. [arrow]
via GitHub
Re: [I] [C++] C++ array kernels framework and execution buildout (umbrella issue) [arrow]
via GitHub
Re: [I] [Python] Create tools to enable optional components (like Gandiva, Flight) to be built and deployed as separate Python packages [arrow]
via GitHub
Re: [I] [Python] Allow fast writing of Decimal column to parquet [arrow]
via GitHub
Re: [I] [C++] Rearrange code in bit-util.h/.cc for AppendWord [arrow]
via GitHub
Re: [I] [C++][Dataset] Add test case to check if all essential properties are preserved after ScannerBuilder::Project is called [arrow]
via GitHub
Re: [I] [C++]Expose API for pushing down rep/def level comparison down to decoder [arrow]
via GitHub
Re: [I] [Python] Test error message when discovering dataset with invalid files [arrow]
via GitHub
Re: [I] [C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool [arrow]
via GitHub
Re: [I] [Format] Create reference implementations of IPC RecordBatch body compression from ARROW-300 [arrow]
via GitHub
Re: [I] [Python][Documentation] Add column limit recommendations Parquet page [arrow]
via GitHub
Re: [I] [C++] Implement Array to JSON function [arrow]
via GitHub
Re: [I] [C++] Add "TypeResolver" class interface to replace current OutputType::Resolver pattern [arrow]
via GitHub
Re: [I] [Dev] Use --password-stdin for docker login from archery [arrow]
via GitHub
Re: [I] [Python] supporting pandas sparse series in pyarrow [arrow]
via GitHub
Re: [I] [C++] Don't re-initialize Minio in every s3fs benchmark [arrow]
via GitHub
Re: [I] [C++] Make ThreadPool task ordering configurable [arrow]
via GitHub
Re: [I] [C++] Determine desirable maximum length for ExecBatch in pipelined and parallel execution of kernels [arrow]
via GitHub
Re: [I] [C++] Determine strategy for propagating failures in initializing built-in function registry in arrow/compute [arrow]
via GitHub
Re: [I] [R] Implementing tidyr interface [arrow]
via GitHub
Re: [I] [Release] Website release notes count not strictly release associated patches [arrow]
via GitHub
Re: [I] [C++] Arrow-native C++ Data Frame-style programming interface for analytics (umbrella issue) [arrow]
via GitHub
Re: [I] [C++] Parallelize execution of arrow::compute::ScalarFunction [arrow]
via GitHub
Re: [I] [Python] Add necessary plumbing to enable Numba-generated functions to be registered as functions in the global C++ function/kernels registry [arrow]
via GitHub
Re: [I] [Python] An independent Cython package for Cython-based projects that want to program against the C data interface [arrow]
via GitHub
Re: [I] [C++] Scalar formatting code used in array/diff.cc should be reusable [arrow]
via GitHub
Re: [I] [C++] Benchmark hash table against thirdparty options, possibly vendor a thirdparty hash table library [arrow]
via GitHub
Re: [I] [C++] Add VectorFunction wrapping arrow::Concatenate [arrow]
via GitHub
Re: [I] [C++] Deprecate or remove Scalar::Parse and Scalar::CastTo [arrow]
via GitHub
Re: [I] [Python] Add tests to verify that one can build a C++ extension against the manylinux1 wheels [arrow]
via GitHub
Re: [I] [C++][Gandiva][MinGW] Enable crashed tests [arrow]
via GitHub
Re: [I] [Format] Add forward compatibility checks for Decimal::bitWidth to reference libraries [arrow]
via GitHub
Re: [I] [C++][Compute] Formalize Op functor concept [arrow]
via GitHub
Re: [I] [FlightRPC][C++][Python] Allow updating TLS certificate at runtime [arrow]
via GitHub
Re: [I] [Python][Dataset] Write a custom field to _metadata caching file size [arrow]
via GitHub
Re: [I] [C++][Developer] Implement tool to compile and run C++ benchmarks from master branch against older codebase revisions [arrow]
via GitHub
Re: [I] [C++] Add vectorized "IntegersMultipleOf" to arrow/util/int_util.h [arrow]
via GitHub
Re: [I] [C++] Add crossbow job to capture build setup [arrow]
via GitHub
Re: [I] [Python][Packaging] Enable S3 support in Windows wheels [arrow]
via GitHub
Re: [I] [C++] Implement PrettyPrint for Scalars [arrow]
via GitHub
Re: [I] [C++] Detect unauthorized memory allocations in function kernels [arrow]
via GitHub
Re: [I] [C++][Dataset][Python] ParquetDataset typecast on read [arrow]
via GitHub
Re: [I] [R] Add chunk_size to Table$create() [arrow]
via GitHub
Re: [I] [C++] Replace usages of TestBase::MakeRandomArray in testing/gtest_util.h with RandomArrayGenerator [arrow]
via GitHub
Re: [I] [FlightRPC][Integration] Add support for setting metadata version for integration tests [arrow]
via GitHub
Re: [I] [Python] Expose CpuInfo for informational / debugging purposes [arrow]
via GitHub
Re: [I] [C++/Python] Support necessary functionality to have an Arrow-string type in pandas [arrow]
via GitHub
Re: [I] [C++] SchemaFromJSON for testing deeply nested schemas [arrow]
via GitHub
Re: [I] [C++/Python] Add option to Take kernel to interpret negative indices as NULL [arrow]
via GitHub
Re: [I] [C++/Python] Add option to Take kernel to interpret negative indices as indexing from the right [arrow]
via GitHub
Re: [I] [Python] pyarrow pyarrow.lib.ArrowTypeError, how do I construct with schema? [arrow]
via GitHub
Re: [I] [Python]: Arrow Flight SQL server communication issue with JDBC Arrow FlightSQL driver [arrow]
via GitHub
Re: [I] Python API - Do I need to optimize filters for querying a dataset? [arrow]
via GitHub
Re: [I] Token parameter error - flight-sql-jdbc-driver-13.0.0 [arrow]
via GitHub
Re: [I] How to share a table in memory between 2 python programs running separately on the same machine / Usage of foreign_buffer() method in python. [arrow]
via GitHub
Re: [I] How to force kerberos authentification for pyarrow 'open_input_file' [arrow]
via GitHub
Re: [I] Streaming Arrow Buffer Over Http API [arrow]
via GitHub
Re: [I] [Python] Dictionary values are not round-tripping properly from and to pandas [arrow]
via GitHub
Re: [I] [Python] Options for S3FileSystem to support Requester Pays enabled S3 buckets [arrow]
via GitHub
Re: [I] [Python] Non zero-copy of pa.table.to_pandas() for simple case [arrow]
via GitHub
Re: [I] pyarrow flight error: Could not finish writing before closing [arrow]
via GitHub
Re: [I] Does/Can arrow compute functions use BLAS libs? [arrow]
via GitHub
Re: [I] [Python ] Support array objects (beyond numpy) in python->arrow conversion [arrow]
via GitHub
Re: [I] [Python] Pyarrow Group_by ChunkedArray and most frequent value [arrow]
via GitHub
Re: [I] [Python] How to create pa.Table from multiple pa.Table's by making each of them column of struct type. [arrow]
via GitHub
Re: [I] Support for Nested Schema of Lists and Structs [arrow]
via GitHub
Re: [I] [Python] duration[arrow] support in pandas [arrow]
via GitHub
Re: [I] segmentation fault from opening large single column csv with small blockszie pyarrow.csv.open_csv() [arrow]
via GitHub
Re: [I] [C++][Parquet] Would `ReadNewPage` conflict with data_page_filter? [arrow]
via GitHub
Re: [I] [C++] How to use CallFunction() when arg is a ExtensionScalar(ExtensionType) [arrow]
via GitHub
Re: [I] use pyarrow.parquet.read_schema on parquet file in cloud storage [arrow]
via GitHub
Re: [I] What is the best way to consume a stream of record batches from another process? [arrow]
via GitHub
Re: [I] [Python] Using unify_schema() during schema evolution fails [arrow]
via GitHub
Earlier messages
Later messages