github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/05/01
Re: [PR] [TUI] Show job's stages and their tasks [datafusion-ballista]
via GitHub
2026/05/01
Re: [I] [EPIC] Improve awslabs published results for Comet w/ TPC-DS [datafusion-comet]
via GitHub
2026/05/01
Re: [I] Explore implementing some Comet accelerated expressions in Scala [datafusion-comet]
via GitHub
2026/05/01
Re: [I] [TUI] Render Executor's system & process metrics [datafusion-ballista]
via GitHub
2026/05/01
Re: [I] Explore implementing some Comet accelerated expressions in Scala [datafusion-comet]
via GitHub
2026/05/01
Re: [I] [TUI] Render Executor's system & process metrics [datafusion-ballista]
via GitHub
2026/05/01
[I] [TUI] Render Executor's system & process metrics [datafusion-ballista]
via GitHub
2026/05/01
[I] Make session setup an extension point for the CLI [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: early termination for right semi/anti hash joins [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [PR] PushdownFilter optimizations [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
[PR] feat(bench): show TPC-H query timings in seconds and add total time [datafusion-ballista]
via GitHub
2026/05/01
[PR] docs: refresh Gluten comparison with ANSI, Spark 4, and Iceberg coverage [datafusion-comet]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
[PR] perf: optimize scalar aggregate joins as semi joins [datafusion]
via GitHub
2026/05/01
Re: [I] Conversion from FileDecryptionProperties to ConfigFileDecryptionProperties should be fallible [datafusion]
via GitHub
2026/05/01
Re: [PR] fix: Make conversion from FileDecryptionProperties to ConfigFileDecryptionProperties fallible [datafusion]
via GitHub
2026/05/01
Re: [PR] fix: Make conversion from FileDecryptionProperties to ConfigFileDecryptionProperties fallible [datafusion]
via GitHub
2026/05/01
Re: [PR] fix: Make conversion from FileDecryptionProperties to ConfigFileDecryptionProperties fallible [datafusion]
via GitHub
2026/05/01
Re: [PR] fix(sort-shuffle): bound writer memory with per-task spill threshold [datafusion-ballista]
via GitHub
2026/05/01
Re: [PR] fix: Make conversion from FileDecryptionProperties to ConfigFileDecryptionProperties fallible [datafusion]
via GitHub
2026/05/01
Re: [PR] fix: Make conversion from FileDecryptionProperties to ConfigFileDecryptionProperties fallible [datafusion]
via GitHub
2026/05/01
[PR] chore(deps): bump cc from 1.2.60 to 1.2.61 in /native in the all-other-cargo-deps group [datafusion-comet]
via GitHub
2026/05/01
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/05/01
Re: [I] Implement group join [datafusion]
via GitHub
2026/05/01
Re: [I] EXPLAIN ANALYZE row counts and elapsed_compute inflated by partition count [datafusion-ballista]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] fix: add parentheses in binary expr human_display to reflect precedence [datafusion]
via GitHub
2026/05/01
Re: [PR] perf: simplify HashJoinExec dynamic filter, drop CASE routing [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] feat: Cache ballista clients on executor [datafusion-ballista]
via GitHub
2026/05/01
Re: [PR] fix(sort-shuffle): bound writer memory with per-task spill threshold [datafusion-ballista]
via GitHub
2026/05/01
Re: [PR] feat(aqe): Support sort-based shuffle writer in AQE [datafusion-ballista]
via GitHub
2026/05/01
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/01
Re: [PR] feat: Support RIGHT/FULL joins in NLJ memory-limited execution [datafusion]
via GitHub
2026/05/01
Re: [PR] feat: Support RIGHT/FULL joins in NLJ memory-limited execution [datafusion]
via GitHub
2026/05/01
Re: [PR] feat: Support RIGHT/FULL joins in NLJ memory-limited execution [datafusion]
via GitHub
2026/05/01
Re: [PR] feat: Support RIGHT/FULL joins in NLJ memory-limited execution [datafusion]
via GitHub
2026/05/01
Re: [PR] fix(sort-shuffle): bound writer memory with per-task spill threshold [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] feat(aqe): Support sort-based shuffle writer in AQE [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] EXPLAIN ANALYZE row counts and elapsed_compute inflated by partition count [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] Adaptive query planner does not support sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/30
[PR] Add wide-schema benchmark suite for measuring per-file metadata overhead [datafusion]
via GitHub
2026/04/30
[PR] feat(aqe): Support sort-based shuffle writer in AQE [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] fix(spark): align parse_url empty FILE path [datafusion]
via GitHub
2026/04/30
Re: [I] FIRST/LAST returns wrong result with `PartialMerge` [datafusion-comet]
via GitHub
2026/04/30
Re: [I] EXPLAIN ANALYZE row counts and elapsed_compute inflated by partition count [datafusion-ballista]
via GitHub
2026/04/30
[PR] fix(spark): align parse_url empty FILE path [datafusion]
via GitHub
2026/04/30
Re: [PR] PushdownFilter optimizations [datafusion]
via GitHub
2026/04/30
Re: [PR] chore(deps): bump ctor from 0.11.1 to 0.12.0 [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/30
Re: [I] `FilterExec` empty projection is changed to full projection after serde [datafusion]
via GitHub
2026/04/30
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
2026/04/30
Re: [I] Add benchmarks for queries against wide schema parquet files [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
2026/04/30
Re: [I] Add tests for spill file sizes [datafusion]
via GitHub
2026/04/30
Re: [I] Add tests for spill file sizes [datafusion]
via GitHub
2026/04/30
[PR] fix: honor strictFloatingPoint in RangePartitioning [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] feat[expr-common]: support REE in coalesce [datafusion]
via GitHub
2026/04/30
Re: [PR] perf: optimize retract_batch for `median` and `percentile_cont` [datafusion]
via GitHub
2026/04/30
Re: [I] Regex (~) and LIKE on RunEndEncoded<Dictionary(...)> columns fail to plan [datafusion]
via GitHub
2026/04/30
Re: [PR] feat[expr-common]: support regex and LIKE coercion on REE and Dict value types that require an extra coercion step [datafusion]
via GitHub
2026/04/30
Re: [I] coalesce on RunEndEncoded column fails to plan [datafusion]
via GitHub
2026/04/30
Re: [PR] feat[expr-common]: support REE in coalesce [datafusion]
via GitHub
2026/04/30
Re: [PR] perf: optimize retract_batch for `median` and `percentile_cont` [datafusion]
via GitHub
2026/04/30
Re: [PR] feat[expr-common]: support regex and LIKE coercion on REE and Dict value types that require an extra coercion step [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: Cache ballista clients on executor [datafusion-ballista]
via GitHub
2026/04/30
[PR] chore(deps): bump ctor from 0.11.1 to 0.12.0 [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] Release sqlparser-rs version `0.62.0` around 2026-04-01 [datafusion-sqlparser-rs]
via GitHub
2026/04/30
Re: [I] CLI (non criterion) interface for sql runner [datafusion]
via GitHub
2026/04/30
[PR] feat(sort-shuffle): accept Option<Partitioning> [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] chore: Update `spark_expressions_support.md` doc [datafusion-comet]
via GitHub
2026/04/30
Re: [I] tracking tpc-h sf=100 benchmark results [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] tracking tpc-h sf=100 benchmark results [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] feat(shuffle_bench): measure end-to-end write + read times [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] fix(sort-shuffle): bound writer memory with per-task spill threshold [datafusion-ballista]
via GitHub
2026/04/30
[I] Add benchmarks for queries against wide schema parquet files [datafusion]
via GitHub
2026/04/30
[PR] feat(executor): default memory pool to concurrent_tasks * 1 GiB [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] feat(sort-shuffle): disable sort-based shuffle by default [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/30
Re: [PR] feat(sort-shuffle): disable sort-based shuffle by default [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] fix: error on CREATE EXTERNAL TABLE with no files and no explicit schema [datafusion]
via GitHub
2026/04/30
[PR] fix(sort-shuffle): bound writer memory with per-task spill threshold [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] plan_to_sql drops window expressions for Window(Aggregate) plans without Projection [datafusion]
via GitHub
2026/04/30
Re: [PR] fix: fold Limit/Sort into outer SELECT when Projection claims Aggregate through them [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: Native Delta Lake scan via delta-kernel-rs [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] feat: add Spark-compatible xxhash64 function [datafusion]
via GitHub
2026/04/30
[I] EXPLAIN ANALYZE row counts and elapsed_compute inflated by partition count [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] docs: add implement-comet-expression Claude skill [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] feat: add Spark-compatible xxhash64 function [datafusion]
via GitHub
2026/04/30
Re: [PR] chore: Update `spark_expressions_support.md` doc [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] chore: Update `spark_expressions_support.md` doc [datafusion-comet]
via GitHub
2026/04/30
[PR] feat: add Spark-compatible xxhash64 function [datafusion]
via GitHub
2026/04/30
Re: [PR] fix: error on CREATE EXTERNAL TABLE with no files and no explicit schema [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: add config to gate converting Spark shuffle to Comet shuffle when child is non-Comet plan [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] feat: add config to gate converting Spark shuffle to Comet shuffle when child is non-Comet plan [datafusion-comet]
via GitHub
2026/04/30
[PR] feat(shuffle_bench): measure end-to-end write + read times [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] feat: add PySpark validation script for datafusion-spark .slt tests [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: Add Spark-compatible `xxhash64` and `murmur3` hash functions [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: Add Spark-compatible `xxhash64` and `murmur3` hash functions [datafusion]
via GitHub
2026/04/30
[PR] feat: format-agnostic serde hook for PhysicalExpr (Column + NotExpr) [datafusion]
via GitHub
2026/04/30
Re: [I] Split proto serialization to encapsulate private state [datafusion]
via GitHub
2026/04/30
[PR] fix: error on CREATE EXTERNAL TABLE with no files and no explicit schema [datafusion]
via GitHub
2026/04/30
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/04/30
Re: [I] tracking tpc-h sf=100 benchmark results [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] fix: fold Limit/Sort into outer SELECT when Projection claims Aggregate through them [datafusion]
via GitHub
2026/04/30
[PR] feat(sort-shuffle): disable sort-based shuffle by default [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] tracking tpc-h sf=100 benchmark results [datafusion-ballista]
via GitHub
2026/04/30
[I] tracking tpc-h sf=100 benchmark results [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] Replace true_count() and false_count() with has_true() and has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
[PR] docs: add benchmarking guide for contributors [datafusion-ballista]
via GitHub
2026/04/30
[I] Update benchmark results in README [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] investigate slowdown in sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] investigate slowdown in sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] perf(sort-shuffle): fix performance regression caused by datafusion upgrade [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] perf(sort-shuffle): fix performance regression caused by datafusion upgrade [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] Split proto serialization to encapsulate private state [datafusion]
via GitHub
2026/04/30
Re: [PR] [DRAFT] Add support for lambda column capture [datafusion]
via GitHub
2026/04/30
Re: [PR] build: Enable Spark SQL tests for Spark 4.1.1 [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] test: support fallback chain in CometPlanStabilitySuite, dedupe existing goldens [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] proto: serialize and dedupe dynamic filters [datafusion]
via GitHub
2026/04/30
Re: [PR] proto: serialize and dedupe dynamic filters [datafusion]
via GitHub
2026/04/30
[PR] feat: add config to gate converting Spark shuffle to Comet shuffle when child is non-Comet plan [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] perf: simplify HashJoinExec dynamic filter, drop CASE routing [datafusion]
via GitHub
2026/04/30
Re: [PR] Fix no-op Transformed flags [datafusion]
via GitHub
2026/04/30
Re: [I] investigate slowdown in sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] perf: simplify HashJoinExec dynamic filter, drop CASE routing [datafusion]
via GitHub
2026/04/30
[I] investigate slowdown in sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] [DRAFT] Add support for lambda column capture [datafusion]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] Skip unnecessary plan rebuild in adjust_input_keys_ordering for non-join plans [datafusion]
via GitHub
2026/04/30
Re: [PR] chore: Update `spark_expressions_support.md` doc [datafusion-comet]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] Fix no-op Transformed flags [datafusion]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] Fix no-op Transformed flags [datafusion]
via GitHub
2026/04/30
[PR] Fix no-op Transformed flags [datafusion]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] fix(rest): remove unwrap and return 404 if executor does not exist [datafusion-ballista]
via GitHub
2026/04/30
[PR] Update `spark_expressions_support.md` doc [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] fix(rest): remove unwrap and return 404 if executor does not exist [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] EnforceDistribution: adjust_input_keys_ordering returns Transformed::yes unconditionally for non-join plans [datafusion]
via GitHub
2026/04/30
Re: [PR] Skip unnecessary plan rebuild in adjust_input_keys_ordering for non-join plans [datafusion]
via GitHub
2026/04/30
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] fix(rest): remove unwrap and return 404 if executor does not exist [datafusion-ballista]
via GitHub
2026/04/30
Re: [I] feat: make it user configurable to display plans as tree renderer [datafusion-ballista]
via GitHub
2026/04/30
[PR] feat: improve shuffle size estimation [experimental!] [datafusion-comet]
via GitHub
2026/04/30
[PR] fix: broadcast exchange bypasses AQE partition coalescing [WIP] [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] test(sqllogictest): stabilize parquet output_rows_skew with WITH ORDER [datafusion]
via GitHub
2026/04/30
Re: [PR] Adding Use of arrow's has_true() / has_false() [datafusion]
via GitHub
2026/04/30
Re: [PR] test(sqllogictest): stabilize parquet output_rows_skew with WITH ORDER [datafusion]
via GitHub
2026/04/30
Re: [PR] test(sqllogictest): stabilize parquet output_rows_skew with WITH ORDER [datafusion]
via GitHub
2026/04/30
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
2026/04/30
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/04/30
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/04/30
Re: [PR] PushdownFilter optimizations [datafusion]
via GitHub
2026/04/30
[PR] fix(rest): remove unwrap and return 404 if executor does not exist [datafusion-ballista]
via GitHub
2026/04/30
[PR] docs: explain Java vs Rust regexp engine differences in compatibility guide [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] fix: track join_arrays memory in reservation after SMJ spill [datafusion]
via GitHub
2026/04/30
Re: [I] Release sqlparser-rs version `0.62.0` around 2026-04-01 [datafusion-sqlparser-rs]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] add any_match higher-order function [datafusion]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] Add ClickBench URL pushdown benchmark [datafusion]
via GitHub
2026/04/30
Re: [PR] chore: fix `datafusion-spark` substring [datafusion]
via GitHub
2026/04/30
Re: [PR] Add ClickBench URL pushdown benchmark [datafusion]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] proto: serialize and dedupe dynamic filters v2 [datafusion]
via GitHub
2026/04/30
Re: [I] Some `datafusion-spark` expressions are missing the physical implementation [datafusion]
via GitHub
2026/04/30
Re: [PR] chore: fix `datafusion-spark` substring [datafusion]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] add any_match higher-order function [datafusion]
via GitHub
2026/04/30
Re: [PR] PushdownFilter optimizations [datafusion]
via GitHub
2026/04/30
Re: [I] Create new Comet logo that is consistent with other DataFusion Projects [datafusion-comet]
via GitHub
2026/04/30
Re: [PR] PushdownFilter optimizations [datafusion]
via GitHub
2026/04/30
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/30
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/30
[PR] chore: use Datafusion `substring` [datafusion-comet]
via GitHub
2026/04/30
Re: [D] DISCUSSION: Seattle DataFusion Meetup (April 23, 2026) [datafusion]
via GitHub
Earlier messages
Later messages