github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/27
Re: [PR] feat: support nullary aggregate UDFs [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: support nullary aggregate UDFs [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: support nullary aggregate UDFs [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: support nullary aggregate UDFs [datafusion]
via GitHub
2026/06/27
[I] Execute uncorrelated scalar subqueries first and substitute the value, instead of rewriting to a join [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] Improve constant folding for associative operations [datafusion]
via GitHub
2026/06/27
Re: [PR] Improve constant folding for associative operations [datafusion]
via GitHub
2026/06/27
Re: [PR] Improve constant folding for associative operations [datafusion]
via GitHub
2026/06/27
Re: [I] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/27
Re: [I] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/27
Re: [PR] Coalesce single-column sort runs during spill [datafusion]
via GitHub
2026/06/27
Re: [PR] chore(deps): bump itertools from 0.14.0 to 0.15.0 [datafusion]
via GitHub
2026/06/27
[PR] Coalesce single-column sort runs during spill [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/27
Re: [PR] fix(spark): return error from ELT coerce_types when fewer than 2 args [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: add array_avg scalar function [datafusion]
via GitHub
2026/06/27
Re: [PR] feat(functions-aggregate): support sum(interval) [datafusion]
via GitHub
2026/06/27
Re: [PR] chore(deps): bump pbjson-types from 0.8.0 to 0.9.0 in the proto group [datafusion]
via GitHub
2026/06/27
Re: [PR] chore(deps): bump pbjson-types from 0.8.0 to 0.9.0 in the proto group [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/27
Re: [PR] fix(spark): return error from ELT coerce_types when fewer than 2 args [datafusion]
via GitHub
2026/06/27
Re: [PR] feat(functions-aggregate): support sum(interval) [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: add array_avg scalar function [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] feat(parquet): row-group morselization for sibling FileStream stealing [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
[PR] perf: sort multi-column runs via row format and coalesce them [datafusion]
via GitHub
2026/06/27
[I] DataFusion 54: uncorrelated scalar subqueries (q11/q15/q22) fail — ScalarSubqueryExpr cannot be deserialized in a split stage plan [datafusion-ballista]
via GitHub
2026/06/27
Re: [I] DataFusion 54: DataSourceExec shared work-queue makes each Ballista scan task read the whole table (wrong results + ~Nx slowdown) [datafusion-ballista]
via GitHub
2026/06/27
[I] Scheduler marks executor dead on a deterministic task launch/decode failure, hanging the job [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/27
[PR] build(deps): bump log from 0.4.32 to 0.4.33 [datafusion-python]
via GitHub
2026/06/27
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/27
[PR] build(deps): bump uuid from 1.23.3 to 1.23.4 [datafusion-python]
via GitHub
2026/06/27
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/27
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/27
[PR] Fix non-portable `find -iname` in verify-release-candidate.sh [datafusion-python]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [I] Projection pushdown into file scan duplicates non-deterministic functions (regression in 52.0.0) [datafusion]
via GitHub
2026/06/27
Re: [PR] feat(parquet): row-group morselization for sibling FileStream stealing [datafusion]
via GitHub
2026/06/27
Re: [PR] Improve constant folding for associative operations [datafusion]
via GitHub
2026/06/27
Re: [I] DataFusion 54: distributed tasks queued after the first wave run ~8x slower (parquet decode) [datafusion-ballista]
via GitHub
2026/06/27
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/27
[I] DataFusion 54: distributed tasks queued after the first wave run ~8x slower (parquet decode) [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] fix: honor client datafusion.* session config overrides in scheduler planning [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] fix: honor client datafusion.* session config overrides in scheduler planning [datafusion-ballista]
via GitHub
2026/06/27
Re: [I] datafusion.* session config overrides from the client are ignored by scheduler-side (AQE) planning [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] fix: drop a deep `BinaryExpr` chain iteratively to avoid a stack overflow [datafusion]
via GitHub
2026/06/27
[I] Projection pushdown into file scan duplicates non-deterministic functions (regression in 52.0.0) [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/27
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] Complete migration of remaining string UDFs to fallible string builder APIs [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/27
[PR] Allow visits from both visitor and VisitorMut at the same time without using trait disambiguation [datafusion-sqlparser-rs]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/27
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/27
[I] Panic in DataFusion 54.0.0 when ordering Parquet scan by computed projection alias [datafusion]
via GitHub
2026/06/27
Re: [PR] feat: cap spill merge fan-in [datafusion]
via GitHub
2026/06/27
Re: [I] [Bug] make_timestamp does not throw under spark.sql.ansi.enabled=true [datafusion-comet]
via GitHub
2026/06/27
[PR] Complete migration of remaining string UDFs to fallible string builder APIs [datafusion]
via GitHub
2026/06/27
Re: [I] Improve internal worker parallelism support [datafusion]
via GitHub
2026/06/27
Re: [PR] chore(datasource): remove deprecated add_row_stats (Closes #23080 - partial) [datafusion]
via GitHub
2026/06/27
Re: [PR] Add regression coverage for quoted dotted column aliases [datafusion]
via GitHub
2026/06/27
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [D] DISCUSSION: Boston Datafusion Meetup September 2026 [datafusion]
via GitHub
2026/06/27
[PR] Parquet row filter struct access tree [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/27
Re: [I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
[I] [Sort Pushdown · Future B] Page-level dynamic prune at RG boundary — refresh PagePruningPredicate using runtime DynamicFilter (follow-up #22450) [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/27
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Improve stage encoding size by removing unrelated partition location [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Comet aggregation task crashes with `offset overflow` [datafusion-comet]
via GitHub
2026/06/26
[PR] Improve constant folding for associative operations [datafusion]
via GitHub
2026/06/26
Re: [PR] issue: 2607- added support for MapFromEntries [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] Feat/regexp extract [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: EnforceDistribution optimizer preserves fetch (LIMIT) from distribution-changing operators [datafusion]
via GitHub
2026/06/26
Re: [PR] docs: add TreeNode API examples for walking and rewriting LogicalPlans [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: initialize TopK dynamic filter threshold from parquet statistics [datafusion]
via GitHub
2026/06/26
Re: [PR] Add DecomposeAggregate optimizer to rewrite AVG as SUM/COUNT [datafusion]
via GitHub
2026/06/26
Re: [PR] feat(parquet): row-group morselization for sibling FileStream stealing [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: Adds IntervalJoinExec for point-in-interval range joins [datafusion]
via GitHub
2026/06/26
Re: [I] GROUP BY on a Dictionary column fails at runtime when distinct group count exceeds the key type's capacity [datafusion]
via GitHub
2026/06/26
Re: [I] GROUP BY on a Dictionary column fails at runtime when distinct group count exceeds the key type's capacity [datafusion]
via GitHub
2026/06/26
Re: [I] Improve python/Cargo.lock drifts - CI deps [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] feat: route Unsupported through codegen dispatch for opt-in serdes [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] feat: route Unsupported through codegen dispatch for opt-in serdes [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] feat: route Unsupported through codegen dispatch for opt-in serdes [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] Fuse multiple scalar-aggregate subqueries over the same source into a single scan [datafusion]
via GitHub
2026/06/26
Re: [PR] chore: surface DataFusion 54 PruningMetrics and Ratio in CometNativeScan metrics [datafusion-comet]
via GitHub
2026/06/26
Re: [I] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
[PR] Fuse multiple scalar-aggregate subqueries over the same source into a single scan [datafusion]
via GitHub
2026/06/26
Re: [I] Fuse multiple scalar-aggregate subqueries over the same source into a single scan [datafusion]
via GitHub
2026/06/26
[I] Fuse multiple scalar-aggregate subqueries over the same source into a single scan [datafusion]
via GitHub
2026/06/26
Re: [I] bug: SPARK-50258: Fix output column order changed issue after AQE optimization fails on 4.0.2 in CI [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] chore: fix `4.0.2` diff to handle `CometWindowsExec` properly [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] chore: fix `4.0.2` diff to handle `CometWindowsExec` properly [datafusion-comet]
via GitHub
2026/06/26
[PR] chore: fix `4.0.2` diff to handle `CometWindowsExec` properly [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] chore(datasource): remove deprecated add_row_stats (Closes #23080 - partial) [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
[PR] Fix syntax example of `array_transform` function [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: thread SubqueryContext explicitly through physical planning [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
[I] bug: SPARK-50258: Fix output column order changed issue after AQE optimization fails on 4.0.2 in CI [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] Support co-partitioned range inner equi joins [datafusion]
via GitHub
2026/06/26
Re: [PR] Align metadata propagation through Physical and Logical casts [datafusion]
via GitHub
2026/06/26
Re: [PR] fix(`EnsureRequirements`): remap sort requirement through `ProjectionExec` on pushdown [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: drop a deep `BinaryExpr` chain iteratively to avoid a stack overflow [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] Fix extension type metadata propagation through casts [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_dyn` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_binary_view_array` and `concat_elements_string_view_array` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] Use `concat_elements_binary_view_array` and `concat_elements_string_view_array` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
[PR] Use `concat_elements_binary_view_array` and `concat_elements_string_view_array` from `arrow-rs` [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
[I] Replace custom ByteView concat kernel with the implementation from arrow-rs [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
Re: [PR] Add row-number late materialization for TopK [datafusion]
via GitHub
2026/06/26
Re: [PR] Split Parquet tail work into morsels [datafusion]
via GitHub
2026/06/26
Re: [I] HashAggregate output regression after #23055: repeated `EmitTo::First` maintains unused GroupValues lookup state [datafusion]
via GitHub
2026/06/26
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/26
[PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/26
Re: [I] Comet produces bloated results in comparison with Spark [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] doc: More comments on GroupedHashAggregateStream refactor [datafusion]
via GitHub
2026/06/26
Re: [I] Comet produces bloated results in comparison with Spark [datafusion-comet]
via GitHub
2026/06/26
Re: [I] Comet aggregation task crashes with `offset overflow` [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] feat: cancel running stages and tasks on job failure or cancellation [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Comet aggregation task crashes with `offset overflow` [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: disable migration aggregate by default [datafusion]
via GitHub
2026/06/26
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/26
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve performance of binary and string concatenation operator [datafusion]
via GitHub
2026/06/26
Re: [PR] deps: revert object_store to 0.13.2 [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] IN LIST: unify bitmap filter implementations [datafusion]
via GitHub
2026/06/26
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] chore(deps): upgrade to DataFusion 54 [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] IN LIST: unify bitmap filter implementations [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: Add Native Support for In-Memory Cache [datafusion-comet]
via GitHub
2026/06/26
[I] Broadcast lowering still hash-repartitions the build and probe sides of a CollectLeft join [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] [9195] optimize group value bytes [datafusion]
via GitHub
2026/06/26
Re: [PR] Improve changelog generator and split CHANGELOG.md into per-version files [datafusion-ballista]
via GitHub
Earlier messages
Later messages