Messages by Thread
-
[PR] [WIP] chore: Deterministic protoc version for linux [datafusion-ballista]
via GitHub
-
[I] Add documentation/examples on how to use substrait plans with Ballista [datafusion-ballista]
via GitHub
-
[PR] fix: [WIP] [iceberg] Remove IcebergFileStream and use iceberg-rust's parallelization [datafusion-comet]
via GitHub
-
[PR] fix: [iceberg] reduce granularity of metrics updates in IcebergFileStream [datafusion-comet]
via GitHub
-
Re: [I] Avoid extra copies in `CoalesceBatchesExec` to improve performance [datafusion]
via GitHub
-
Re: [I] Add `CoalesceBatchesExec` to `NestedLoopJoinExec` [datafusion]
via GitHub
-
Re: [I] Idea: Avoid planning CoalesceBatches in front of blocking operators. [datafusion]
via GitHub
-
Re: [I] `CoalesceBatches` physical optimizer rule says it should be last but it isn't [datafusion]
via GitHub
-
[PR] perf: Improve criterion benchmarks for cast string to int [datafusion-comet]
via GitHub
-
Re: [PR] Add `Field` to `Expr::Cast` -- allow logical expressions to express a cast to an extension type [datafusion]
via GitHub
-
[PR] minor: small improvements in cast from string to int [WIP] [datafusion-comet]
via GitHub
-
Re: [PR] Fix internal error "Physical input schema should be the same as the one converted from logical input schema." [datafusion]
via GitHub
-
Re: [PR] Row group limit pruning for row groups that entirely match predicates [datafusion]
via GitHub
-
Re: [I] [EPIC] Support limit pruning [datafusion]
via GitHub
-
[PR] Migrate SchemaAdapter to PhysicalExprAdapter [datafusion-comet]
via GitHub
-
[I] DataFusion 52 migration [datafusion-comet]
via GitHub
-
[PR] datafusion 46 (do not merge) [datafusion]
via GitHub
-
[PR] chore: Add checks to microbenchmarks for plan running natively in Comet [datafusion-comet]
via GitHub
-
[PR] fix(functions-aggregate): drain CORR state vectors for streaming aggregation [datafusion]
via GitHub
-
[PR] Experimental: Native CSV files read [datafusion-comet]
via GitHub
-
[PR] refactor(repartition): split BatchPartitioner::try_new into hash and round-robin constructors [datafusion]
via GitHub
-
[PR] Update a bunch of dependencies [datafusion]
via GitHub
-
[PR] Remove dependency on rust_decimal [datafusion]
via GitHub
-
[PR] Cherry-pick https://github.com/apache/datafusion/commit/a5cc0313499d7… [datafusion]
via GitHub
-
Re: [I] Postgres partitioning "CREATE TABLE ... PARTITION OF" unparseable [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] GenericDialect: support colon operator for JsonAccess [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] feat: Add arrow flight proxy to scheduler [datafusion-ballista]
via GitHub
-
[I] Provide support for SSL/TSL connections [datafusion-ballista]
via GitHub
-
[I] Split `BatchPartitioner::try_new(..)` into `BatchPartitioner::try_new_hash` and `BatchPartitioner::try_new_round_robin` [datafusion]
via GitHub
-
[PR] Add support for additional numeric types in to_timestamp functions [datafusion]
via GitHub
-
[I] Github Actions covering python submodule improvements [datafusion-ballista]
via GitHub
-
[I] Support group by all in datafusion [datafusion]
via GitHub
-
[PR] [branch-52] Prepare 52.0.0 release version number and changelog [datafusion]
via GitHub
-
Re: [I] [EPIC] Shuffle file execs improvement [datafusion-ballista]
via GitHub
-
[PR] Add a protection to release candidate branch 52 [datafusion]
via GitHub
-
[PR] fix: DynamicFilterPhysicalExpr violates Hash/Eq contract [datafusion]
via GitHub
-
Re: [I] Support ALL operator [datafusion]
via GitHub
-
[I] Track `elapsed_compute` for `AsyncFuncExec` [datafusion]
via GitHub
-
Re: [PR] feat: Use PartialSortExec when input data is sorted on prefix columns [datafusion]
via GitHub
-
Re: [PR] Sort with limit for single arrays [datafusion]
via GitHub
-
[D] Support Spark 3.3/JDK8 [datafusion-comet]
via GitHub
-
Re: [I] `DateAdd` / `DateSub` do not perform overflow checks in release builds [datafusion-comet]
via GitHub
-
Re: [I] SparkDateAdd should use wrapping addition/subtraction [datafusion]
via GitHub
-
Re: [I] SparkDateAdd does not check for overflow [datafusion]
via GitHub
-
Re: [I] Upgrade guide doesn't match api doc [datafusion]
via GitHub
-
[I] Snowflake: does not support statement: Insert(Multi-table) [datafusion-sqlparser-rs]
via GitHub
-
[PR] Downgrade aws-smithy-runtime to avoid rustsec [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal` to avoid rustsec errors [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal` to avoid rustsec errors [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal` to avoid rustsec errors [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal` to avoid rustsec errors [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal` to avoid rustsec errors [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal`, ignore RUSTSEC-2026-0001 to get clean CI [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal`, ignore RUSTSEC-2026-0001 to get clean CI [datafusion]
via GitHub
-
Re: [PR] Downgrade aws-smithy-runtime, update `rust_decimal`, ignore RUSTSEC-2026-0001 to get clean CI [datafusion]
via GitHub
-
Re: [I] Make to_timestamp aware of execution timezone [datafusion]
via GitHub
-
[I] cargo audit failing on main [datafusion]
via GitHub
-
[I] Use pipeline aggregation when data is implicitly sorted by group-by keys [datafusion]
via GitHub
-
[PR] Comet Writer should respect object store settings [datafusion-comet]
via GitHub
-
[I] Comet writer should support dynamicPartitionOverwrite mode [datafusion-comet]
via GitHub
-
[I] Offest parquet pushdown [datafusion]
via GitHub
-
Re: [PR] feat: allow native Iceberg scans with non-identity transform residuals [datafusion-comet]
via GitHub
-
[PR] feat: Bump rust to `rust:1.92-trixie` [datafusion-ballista]
via GitHub
-
[I] `auto` scan mode should select `native_datafusion` for supported use cases [datafusion-comet]
via GitHub
-
[PR] feat: add missing public API documentation/comments [datafusion-ballista]
via GitHub
-
[PR] feat: support `SELECT DISTINCT id FROM t ORDER BY id LIMIT n` query use GroupedTopKAggregateStream [datafusion]
via GitHub
-
[PR] Add support for DuckDB `LAMBDA` keyword syntax [datafusion-sqlparser-rs]
via GitHub
-
Re: [I] Inconsistencies between `RecordBatch` and `DataFrame` schemas cause `to_arrow_table` to fail [datafusion-python]
via GitHub
-
Re: [PR] fix: Inconsistent schemas when converting to pyarrow [datafusion-python]
via GitHub
-
Re: [PR] Add blog post on extending SQL in DataFusion [datafusion-site]
via GitHub
-
Re: [PR] feat: add list_files_cache table function for `datafusion-cli` [datafusion]
via GitHub
-
[I] Andrew Lamb Weekly-ish Open Source plan - 2026-01-05 [datafusion]
via GitHub
-
Re: [I] Andrew Lamb Weekly-ish Open Source plan - 2025-12-08 [datafusion]
via GitHub
-
Re: [PR] Do not convert pyarrow scalar values to plain python types when passing as `lit` [datafusion-python]
via GitHub
-
Re: [PR] fix: use coalesce instead of drop_duplicate_keys for join [datafusion-python]
via GitHub
-
[PR] fix: Percent Encoding of paths for hive style partitioning [datafusion]
via GitHub
-
[I] Partition values are not URL-decoded when extracted from Hive-style paths [datafusion]
via GitHub
-
[PR] chore(deps): bump clap from 4.5.53 to 4.5.54 [datafusion-sandbox]
via GitHub
-
Re: [PR] chore(deps): bump clap from 4.5.50 to 4.5.51 [datafusion-sandbox]
via GitHub
-
Re: [PR] chore(deps): bump tokio-util from 0.7.16 to 0.7.17 [datafusion-sandbox]
via GitHub
-
[PR] chore(deps): bump tokio-util from 0.7.17 to 0.7.18 [datafusion-sandbox]
via GitHub
-
[PR] chore(deps): bump aws-config from 1.8.11 to 1.8.12 [datafusion-sandbox]
via GitHub
-
Re: [PR] chore(deps): bump aws-config from 1.8.7 to 1.8.10 [datafusion-sandbox]
via GitHub