github
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: [I] Replace shuffle implementation with version based on Comet [datafusion-ballista]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
[PR] feat: Utf8View and BinaryView support [datafusion-comet]
via GitHub
[PR] feat(aqe): support executor failure in AdaptiveExecutionGraph [datafusion-ballista]
via GitHub
Re: [I] Improve documentation for configs related to explain/fallback [datafusion-comet]
via GitHub
[PR] feat: rewrite standalone shuffle_bench to drive real Parquet input [datafusion-ballista]
via GitHub
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
[I] Configuration guide should be generated from code [datafusion-ballista]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
[PR] feat: defer sort-shuffle materialization with interleave_record_batch [datafusion-ballista]
via GitHub
Re: [PR] feat: defer sort-shuffle materialization with interleave_record_batch [datafusion-ballista]
via GitHub
Re: [D] DISCUSSION: Seattle DataFusion Meetup (April 23, 2026) [datafusion]
via GitHub
Re: [D] DISCUSSION: Seattle DataFusion Meetup (April 23, 2026) [datafusion]
via GitHub
Re: [D] DISCUSSION: Seattle DataFusion Meetup (April 23, 2026) [datafusion]
via GitHub
Re: [D] DISCUSSION: Seattle DataFusion Meetup (April 23, 2026) [datafusion]
via GitHub
[PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
[PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
Re: [I] Migrate 4.0.1 support to 4.0.2 [datafusion-comet]
via GitHub
Re: [I] Add vector distance, array math, and array aggregate functions [datafusion]
via GitHub
Re: [I] Add vector distance, array math, and array aggregate functions [datafusion]
via GitHub
Re: [PR] feat: change Expr OuterReferenceColumn and Alias to Box type for reducing expr struct size [datafusion]
via GitHub
[I] Unable to use Ballista at scale (e.g. TPC-H @ 1TB) [datafusion-ballista]
via GitHub
Re: [I] Remove `generational-arena` from project [datafusion]
via GitHub
Re: [PR] fix: prevent hash join deadlock when dynamic filtering is enabled [datafusion]
via GitHub
Re: [I] [Feature] Support external Remote Shuffle Service (e.g., Apache Celeborn / Apache Uniffle) [datafusion-ballista]
via GitHub
Re: [I] [Feature] Support external Remote Shuffle Service (e.g., Apache Celeborn / Apache Uniffle) [datafusion-ballista]
via GitHub
Re: [I] Use interleave_record_batch to avoid tiny batches in sort-based shuffle [datafusion-ballista]
via GitHub
Re: [I] Use interleave_record_batch to avoid tiny batches in sort-based shuffle [datafusion-ballista]
via GitHub
Re: [I] Use interleave_record_batch to avoid tiny batches in sort-based shuffle [datafusion-ballista]
via GitHub
Re: [PR] perf: optimise `first_value`, `last_value` aggregate function [datafusion]
via GitHub
Re: [PR] perf: optimise `first_value`, `last_value` aggregate function [datafusion]
via GitHub
Re: [PR] perf: optimise `first_value`, `last_value` aggregate function [datafusion]
via GitHub
Re: [PR] perf: optimise `first_value`, `last_value` aggregate function [datafusion]
via GitHub
Re: [PR] perf: optimise `first_value`, `last_value` aggregate function [datafusion]
via GitHub
[PR] chore: Bump Spark 4.0.1 to 4.0.2 [datafusion-comet]
via GitHub
Re: [PR] chore: Bump Spark 4.0.1 to 4.0.2 [datafusion-comet]
via GitHub
[PR] docs: document hash-based and sort-based shuffle implementations [datafusion-ballista]
via GitHub
Re: [PR] docs: document hash-based and sort-based shuffle implementations [datafusion-ballista]
via GitHub
Re: [PR] docs: document hash-based and sort-based shuffle implementations [datafusion-ballista]
via GitHub
Re: [PR] docs: document hash-based and sort-based shuffle implementations [datafusion-ballista]
via GitHub
[PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
Re: [I] Implement physical execution of uncorrelated scalar subqueries [datafusion]
via GitHub
Re: [I] Improve performance of `array_has` [datafusion]
via GitHub
[I] Comet should fallback to Spark for streaming queries [datafusion-comet]
via GitHub
Re: [I] Comet should fallback to Spark for streaming queries [datafusion-comet]
via GitHub
Re: [PR] ci: add Spark 4.0 / JDK 21 profile [datafusion-comet]
via GitHub
Re: [PR] ci: add Spark 4.0 / JDK 21 profile [datafusion-comet]
via GitHub
Re: [PR] ci: add Spark 4.0 / JDK 21 profile [datafusion-comet]
via GitHub
Re: [PR] ci: add Spark 4.0 / JDK 21 profile [datafusion-comet]
via GitHub
Re: [PR] ci: add Spark 4.0 / JDK 21 profile [datafusion-comet]
via GitHub
[I] Add support for Spark 4.2.0-preview4 [datafusion-comet]
via GitHub
Re: [I] Epic: CometNativeScan improvements (per-partition serde, cleanup, DPP, AQE DPP, V2 operator) [datafusion-comet]
via GitHub
Re: [I] Epic: CometNativeScan improvements (per-partition serde, cleanup, DPP, AQE DPP, V2 operator) [datafusion-comet]
via GitHub
[PR] feat: AQE DPP for native Parquet scans with broadcast reuse [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP for native Parquet scans with broadcast reuse [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP for native Parquet scans with broadcast reuse [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP for native Parquet scans with broadcast reuse [datafusion-comet]
via GitHub
Re: [I] Remove `__unnest_placeholder` from result projection on queries with struct unnest. [datafusion]
via GitHub
Re: [PR] feat: remove `__unnest_placeholder` from struct unnest projection [datafusion]
via GitHub
Re: [I] Fix issues found when documenting current expression incompatibilities [datafusion-comet]
via GitHub
[PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
Re: [I] Add Java 21 to the supported version matrix (particularly for Spark 4.0) [datafusion-comet]
via GitHub
Re: [I] Add Java 21 to the supported version matrix (particularly for Spark 4.0) [datafusion-comet]
via GitHub
Re: [PR] feat: add initial support for `array_exists` with lambda expression support [datafusion-comet]
via GitHub
[PR] WIP: test Spark `array_exists` with lambdas [datafusion]
via GitHub
Re: [PR] WIP: test Spark `array_exists` with lambdas [datafusion]
via GitHub
Re: [PR] feat(unparser): Keep inner join `Filter → TableScan` predicates to `WHERE` instead of moving to `JOIN ON` [datafusion]
via GitHub
Re: [PR] feat(unparser): Keep inner join `Filter → TableScan` predicates to `WHERE` instead of moving to `JOIN ON` [datafusion]
via GitHub
[I] Use `NullBuffer::union_many` when appropriate [datafusion]
via GitHub
Re: [I] Use `NullBuffer::union_many` when appropriate [datafusion]
via GitHub
Re: [I] Use `NullBuffer::union_many` when appropriate [datafusion]
via GitHub
Re: [I] Use `NullBuffer::union_many` when appropriate [datafusion]
via GitHub
Re: [PR] feat: support TimestampType join keys in SortMergeJoin [datafusion-comet]
via GitHub
[I] Bug triage results: 2026-04-27 [datafusion-comet]
via GitHub
Re: [I] Bug triage results: 2026-04-27 [datafusion-comet]
via GitHub
[PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
Re: [I] chore: DataFusion 54.0.0 [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [PR] feat: support regular BuildRight+LeftAnti hash join [datafusion-comet]
via GitHub
Re: [I] chore: Create CI action that validates links in md files [datafusion]
via GitHub
[PR] Update documentation for PhysicalExpr::evaluate_bounds [datafusion]
via GitHub
Re: [PR] Update documentation for PhysicalExpr::evaluate_bounds [datafusion]
via GitHub
Re: [PR] Update documentation for PhysicalExpr::evaluate_bounds [datafusion]
via GitHub
Re: [PR] Update documentation for PhysicalExpr::evaluate_bounds [datafusion]
via GitHub
Re: [PR] Update documentation for PhysicalExpr::evaluate_bounds [datafusion]
via GitHub
[PR] docs: rename SQL File Tests to Comet SQL Tests [datafusion-comet]
via GitHub
Re: [PR] docs: rename SQL File Tests to Comet SQL Tests [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
[I] Optimize Hash Aggregation for Multiple Dictionary-Encoded Group Keys [datafusion]
via GitHub
[I] Refactor Spark 4.0 and 4.1 shims [datafusion-comet]
via GitHub
Re: [I] Refactor Spark 4.0 and 4.1 shims [datafusion-comet]
via GitHub
[PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] Fix clippy 1.95 lint errors [datafusion-sqlparser-rs]
via GitHub
[I] Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [I] Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [I] Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
Re: [PR] Resolve MIN/MAX from Parquet metadata for Single-mode aggregates and CAST projections [datafusion]
via GitHub
Re: [PR] Resolve MIN/MAX from Parquet metadata for Single-mode aggregates and CAST projections [datafusion]
via GitHub
Re: [PR] Resolve MIN/MAX from Parquet metadata for Single-mode aggregates and CAST projections [datafusion]
via GitHub
Re: [PR] Resolve MIN/MAX from Parquet metadata for Single-mode aggregates and CAST projections [datafusion]
via GitHub
Re: [PR] feat(clickhouse): support PARTITION BY after ORDER BY and ARRAY JOIN [datafusion-sqlparser-rs]
via GitHub
Re: [I] Blog about DataFusion correlated subquery support [datafusion]
via GitHub
Re: [I] Blog about DataFusion correlated subquery support [datafusion]
via GitHub
Re: [PR] Blog: Row-Level DML in DataFusion [datafusion-site]
via GitHub
Re: [PR] Introduce dependent join `LogicalPlan` to support complex subquery decorrelation [datafusion]
via GitHub
Re: [PR] feat(parquet): row-group morselization for sibling FileStream stealing [datafusion]
via GitHub
Re: [I] Blog post about 1000 distinct committers / history of the project [datafusion]
via GitHub
Re: [I] Blog post about 1000 distinct committers / history of the project [datafusion]
via GitHub
Re: [I] Blog post about 1000 distinct committers / history of the project [datafusion]
via GitHub
Re: [I] [DISCUSSION] DataFusion Road Map: Q1 2026 [datafusion]
via GitHub
Re: [I] [DISCUSSION] DataFusion Road Map: Q1 2026 [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
Re: [PR] feat: LogicalPlanningPipeline [datafusion]
via GitHub
Re: [PR] feat: LogicalPlanningPipeline [datafusion]
via GitHub
Re: [PR] feat: LogicalPlanningPipeline [datafusion]
via GitHub
Re: [PR] chore(deps): bump sha2 from 0.10.9 to 0.11.0 [datafusion]
via GitHub
Re: [PR] chore(deps): bump sha2 from 0.10.9 to 0.11.0 [datafusion]
via GitHub
Re: [PR] chore(deps): bump md-5 from 0.10.6 to 0.11.0 [datafusion]
via GitHub
Re: [PR] chore(deps): bump md-5 from 0.10.6 to 0.11.0 [datafusion]
via GitHub
Re: [I] DataFusion CLI is not working on Windows [datafusion]
via GitHub
[PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
Re: [PR] Refactor InListExpr into static-filter modules [datafusion]
via GitHub
Re: [PR] Refactor InListExpr into static-filter modules [datafusion]
via GitHub
Earlier messages
Later messages