github
Thread
Date
Earlier messages
Messages by Thread
[PR] docs: remove unused status legend entries from expression reference [datafusion-comet]
via GitHub
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
[PR] fix: Reject out-of-range ArrayMap probe keys on 32-bit targets [datafusion]
via GitHub
[I] `ArrayMap` maps probe keys to buckets incorrectly on 32-bit hosts [datafusion]
via GitHub
Re: [I] `ArrayMap` maps probe keys to buckets incorrectly on 32-bit hosts [datafusion]
via GitHub
[PR] Do not consume statement terminator in `CREATE USER` option list [datafusion-sqlparser-rs]
via GitHub
Re: [I] Optimize inner joins to semi joins when possible [datafusion]
via GitHub
[PR] Revise example usage section headings [datafusion]
via GitHub
Re: [I] Add ability to process CSV files containing invalid UTF-8 characters [datafusion]
via GitHub
Re: [I] Add ability to process CSV files containing invalid UTF-8 characters [datafusion]
via GitHub
Re: [I] Add ability to process CSV files containing invalid UTF-8 characters [datafusion]
via GitHub
Re: [PR] test: cover push scheduling in client integration tests [datafusion-ballista]
via GitHub
[PR] variant: Integrate datafusion-variant into Datafusion [datafusion]
via GitHub
Re: [PR] variant: Integrate datafusion-variant into Datafusion [datafusion]
via GitHub
[PR] Added support for unpivot in Redshift with expression and bracketsless [datafusion-sqlparser-rs]
via GitHub
[PR] Parse `ALTER USER` as a synonym for `ALTER ROLE` [datafusion-sqlparser-rs]
via GitHub
[PR] refactor: use raw view access in do_append_val_inner and consolidate duplicated logic [datafusion]
via GitHub
[PR] Use cast preimages for cast predicate rewrites [datafusion]
via GitHub
[PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
[PR] Align DataFrame::fill_null column argument with fill_nan [datafusion]
via GitHub
Re: [PR] Align DataFrame::fill_null column argument with fill_nan [datafusion]
via GitHub
[PR] Fix optimize_projections failure with struct-field join keys [datafusion]
via GitHub
Re: [PR] Fix optimize_projections failure with struct-field join keys [datafusion]
via GitHub
[PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
[I] ProjectionPushdown panics with assertion error on mark join (NOT EXISTS with non-equi correlation) [datafusion]
via GitHub
Re: [I] `ProjectionPushdown` internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
[PR] CI: Add cargo audit security workflow [datafusion-sqlparser-rs]
via GitHub
[PR] feat: Add comprehensive metrics tracking to CSV reader [datafusion]
via GitHub
[I] `AdaptivePlanner` uses `job_name` to identify jobs rather than `JobId` [datafusion-ballista]
via GitHub
Re: [I] [DISCUSSION] Adopt datafusion-functions-json and datafusion-variant into core repo [datafusion]
via GitHub
Re: [I] [DISCUSSION] Adopt datafusion-functions-json and datafusion-variant into core repo [datafusion]
via GitHub
Re: [I] [DISCUSSION] Adopt datafusion-functions-json and datafusion-variant into core repo [datafusion]
via GitHub
Re: [I] [DISCUSSION] Adopt datafusion-functions-json and datafusion-variant into core repo [datafusion]
via GitHub
[PR] refactor(hash-aggr): Forward port the partial aggregation skip optimization to the new hash aggregation impl [datafusion]
via GitHub
Re: [PR] refactor(hash-aggr): Migrate the partial aggregation skip optimization to the new hash aggregation impl [datafusion]
via GitHub
Re: [PR] refactor(hash-aggr): Migrate the partial aggregation skip optimization to the new hash aggregation impl [datafusion]
via GitHub
Re: [PR] refactor(hash-aggr): Migrate the partial aggregation skip optimization to the new hash aggregation impl [datafusion]
via GitHub
[PR] chore(deps): bump taiki-e/install-action from 2.81.9 to 2.81.10 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump taiki-e/install-action from 2.81.9 to 2.81.10 [datafusion-ballista]
via GitHub
[I] Replace manual memory tracking with Arrow claim()/with_pool() integration [datafusion]
via GitHub
[PR] fix: TRY_CAST returns NULL for timestamp/date overflow [datafusion]
via GitHub
Re: [PR] fix: TRY_CAST returns NULL for timestamp/date overflow [datafusion]
via GitHub
[I] TRY_CAST returns an error instead of NULL for timestamp/date overflow [datafusion]
via GitHub
Re: [I] TRY_CAST returns an error instead of NULL for timestamp/date overflow [datafusion]
via GitHub
Re: [PR] fix: Correct array_contains behavior for Spark-style null semantics [datafusion-comet]
via GitHub
Re: [PR] perf: do not build parquet pruning predicates if no page index [datafusion]
via GitHub
Re: [I] Avoid FFI import/export when passing batches between two native plans [datafusion-comet]
via GitHub
Re: [I] [Bug] CAST(complex AS STRING) does not honour spark.sql.legacy.castComplexTypesToString.enabled [datafusion-comet]
via GitHub
Re: [I] [Bug] CAST(complex AS STRING) does not honour spark.sql.legacy.castComplexTypesToString.enabled [datafusion-comet]
via GitHub
[I] `optimize_projections` fails with "No field named ..." when join keys contain `get_field` (ExtractLeafExpressions) [datafusion]
via GitHub
Re: [I] `optimize_projections` fails with "No field named ..." when join keys contain `get_field` (ExtractLeafExpressions) [datafusion]
via GitHub
Re: [I] `optimize_projections` fails with "No field named ..." when join keys contain `get_field` (ExtractLeafExpressions) [datafusion]
via GitHub
[PR] chore: Skip Code CI for non code changes [datafusion]
via GitHub
Re: [PR] chore: Skip Code CI for non code changes [datafusion]
via GitHub
[PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
[PR] refactor: make scalar distance u64 and overflow aware [datafusion]
via GitHub
Re: [PR] refactor: make scalar distance u64 and overflow aware [datafusion]
via GitHub
[PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
Re: [PR] chore(docs): Update docs website to bring inline with Datafusion style [datafusion-ballista]
via GitHub
[I] Update the documentation website [datafusion-ballista]
via GitHub
Re: [I] Update the documentation website [datafusion-ballista]
via GitHub
Re: [I] [Bug] array_max and array_min disagree with Spark on NaN ordering [datafusion-comet]
via GitHub
[I] Physical planning: cast low-medium cardinality columns to dictionary arrays before aggregation [datafusion]
via GitHub
Re: [I] Physical planning: cast low-medium cardinality columns to dictionary arrays before aggregation [datafusion]
via GitHub
[PR] Feat/spark levenshtein [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
Re: [PR] feat(spark): add levenshtein with optional threshold support [datafusion]
via GitHub
[PR] chore(deps-dev): bump shell-quote from 1.8.3 to 1.8.4 in /datafusion/wasmtest/datafusion-wasm-app [datafusion-sandbox]
via GitHub
[PR] fix: [DO NOT MERGE] Iceberg scan duplicates rows when splitting a single-row-group Parquet file into multiple byte-range tasks [datafusion-comet]
via GitHub
Re: [I] `nanvl` incorrectly rounds on integer input [datafusion]
via GitHub
[I] June 2026 DataFusion ASF Board Report [datafusion]
via GitHub
Re: [I] June 2026 DataFusion ASF Board Report [datafusion]
via GitHub
Re: [I] June 2026 DataFusion ASF Board Report [datafusion]
via GitHub
Re: [I] June 2026 DataFusion ASF Board Report [datafusion]
via GitHub
Re: [I] June 2026 DataFusion ASF Board Report [datafusion]
via GitHub
[I] September 2026 DataFusion ASF Board Report [datafusion]
via GitHub
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
[PR] [Draft][#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
Re: [I] Implement Spark `weekday` [datafusion]
via GitHub
Re: [I] Optimize ByteViewGroupValueBuilder vectorized_append [datafusion]
via GitHub
[PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
Re: [PR] feat: add native implementations of `regexp_extract` and `regexp_extract_all` [datafusion-comet]
via GitHub
Re: [PR] feat: add native implementations of `regexp_extract` and `regexp_extract_all` [datafusion-comet]
via GitHub
[I] SIMILAR TO panics ('failed to downcast array') when operand types differ (e.g. NULL pattern, Utf8View vs Utf8) [datafusion]
via GitHub
[PR] perf: Extend WindowTopN to support RANK [datafusion]
via GitHub
Re: [PR] perf: Extend WindowTopN to support RANK [datafusion]
via GitHub
Re: [I] AbstractMethodError: CometBroadcastExchangeExec missing sparkContext() from BroadcastExchangeLike [datafusion-comet]
via GitHub
Re: [I] CometNativeException: "arrays of different length" when using to_date on Iceberg Timestamp column [datafusion-comet]
via GitHub
Re: [I] CometNativeException: "arrays of different length" when using to_date on Iceberg Timestamp column [datafusion-comet]
via GitHub
[PR] docs: link to 2026 Q3-Q4 roadmap discussion [datafusion]
via GitHub
Re: [PR] docs: link to 2026 Q3-Q4 roadmap discussion [datafusion]
via GitHub
Re: [PR] docs: link to 2026 Q3-Q4 roadmap discussion [datafusion]
via GitHub
Re: [PR] [Tracking] feat(contrib): Native Delta Lake scan via delta-kernel-rs (Iceberg-style contrib) [datafusion-comet]
via GitHub
[I] Adaptive predicate evaluation [datafusion]
via GitHub
Re: [I] [Proposal] Streaming execution support roadmap [datafusion]
via GitHub
Re: [I] [Proposal] Streaming execution support roadmap [datafusion]
via GitHub
Re: [D] Q2 2026 roadmap planning [datafusion]
via GitHub
Re: [D] Q2 2026 roadmap planning [datafusion]
via GitHub
[I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
Re: [I] [DISCUSSION] 2026 Q3-Q4 Roadmap Discussion [datafusion]
via GitHub
[PR] feat: route structured-text functions through codegen dispatcher [datafusion-comet]
via GitHub
Re: [PR] feat: route structured-text functions through codegen dispatcher [datafusion-comet]
via GitHub
Re: [PR] feat: route structured-text functions through codegen dispatcher [datafusion-comet]
via GitHub
Re: [PR] feat: route structured-text functions through codegen dispatcher [datafusion-comet]
via GitHub
[I] Route structured-text functions (CSV/JSON/XPath/XML) through the codegen dispatcher [datafusion-comet]
via GitHub
Re: [I] Route structured-text functions (CSV/JSON/XPath/XML) through the codegen dispatcher [datafusion-comet]
via GitHub
[I] RIGHT/FULL/NATURAL JOIN ... USING(k) does not coalesce the join key (returns NULL for right-only rows) [datafusion]
via GitHub
[I] Optimize mark joins using existence probes [datafusion]
via GitHub
Re: [I] Optimize mark joins using existence probes [datafusion]
via GitHub
[I] Add Poll::Pending spill stream test coverage for SMJ async spill paths [datafusion]
via GitHub
[PR] feat: route higher-order functions through codegen dispatcher [datafusion-comet]
via GitHub
Re: [PR] feat: route higher-order functions through codegen dispatcher [experimental / WIP] [datafusion-comet]
via GitHub
Re: [PR] feat: route higher-order functions through codegen dispatcher [experimental / WIP] [datafusion-comet]
via GitHub
[I] Route array/map higher-order (lambda) functions through the codegen dispatcher [datafusion-comet]
via GitHub
[PR] fix(sort): record output_batches, output_bytes and end_time for when not using merge sort [datafusion]
via GitHub
Re: [PR] feat: add array_exists with lambda support via CometUDF framework [datafusion-comet]
via GitHub
Re: [PR] feat: add array_exists with lambda support via CometUDF framework [datafusion-comet]
via GitHub
Re: [PR] feat: add initial support for `array_exists` with lambda expression support [datafusion-comet]
via GitHub
Re: [PR] feat: add initial support for `array_exists` with lambda expression support [datafusion-comet]
via GitHub
[I] Optimize semi, anti-joins to avoid dynamic dispatch for comparison [datafusion]
via GitHub
Re: [I] Optimize semi, anti-joins to avoid dynamic dispatch for comparison [datafusion]
via GitHub
Re: [I] Optimize semi, anti-joins to avoid dynamic dispatch for comparison [datafusion]
via GitHub
Re: [I] Optimize semi, anti-joins to avoid dynamic dispatch for comparison [datafusion]
via GitHub
[PR] Feat/table provider ffi [datafusion-java]
via GitHub
[I] Test dual-impl (native + codegen-dispatch) expressions consistently across the full routing matrix [datafusion-comet]
via GitHub
[PR] [Migration] Branch 54 test [datafusion]
via GitHub
Re: [PR] [Migration] Branch 54 test [datafusion]
via GitHub
Re: [PR] [Migration] Branch 54 test [datafusion]
via GitHub
[I] Hash join should omit NULLs from build side under `NullEqualsNothing` [datafusion]
via GitHub
Re: [I] Hash join should omit NULLs from build side under `NullEqualsNothing` [datafusion]
via GitHub
[I] TopK early-exit doesn't fire when a partition's local heap is empty but the shared dynamic filter is already tight [datafusion]
via GitHub
Re: [I] Propagation of metadata through casts can strips extension type from the destination field and can result in invalid extension types [datafusion]
via GitHub
Re: [I] Propagation of metadata through casts can strips extension type from the destination field and can result in invalid extension types [datafusion]
via GitHub
[PR] fix: RIGHT/FULL/NATURAL JOIN USING does not coalesce the join key (returns NULL for right-only rows) [datafusion]
via GitHub
[I] breaking change detector in CI report breaking changes when HEAD is not aligned with base branch [datafusion]
via GitHub
[PR] chore: use internal_err macro in `assert_eq_or_internal_err` for backtrace [datafusion]
via GitHub
Re: [PR] fix: add backtrace for `assert_*_or_internal_err` helpers [datafusion]
via GitHub
Re: [PR] fix: add backtrace for `assert_*_or_internal_err` helpers [datafusion]
via GitHub
Re: [PR] fix: add backtrace for `assert_*_or_internal_err` helpers [datafusion]
via GitHub
Re: [PR] fix: add backtrace for `assert_*_or_internal_err` helpers [datafusion]
via GitHub
Re: [PR] fix: add backtrace for `assert_*_or_internal_err` helpers [datafusion]
via GitHub
[PR] docs: move TopK user defined operator example into extending-operators guide [datafusion]
via GitHub
Re: [PR] docs: move TopK user defined operator example into extending-operators guide [datafusion]
via GitHub
Re: [PR] docs: move TopK user defined operator example into extending-operators guide [datafusion]
via GitHub
Re: [PR] docs: move TopK user defined operator example into extending-operators guide [datafusion]
via GitHub
Re: [PR] IGNORE debug breaking change ci [datafusion]
via GitHub
Re: [PR] IGNORE debug breaking change ci [datafusion]
via GitHub
[PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
Re: [PR] feat(parquet): runtime row-group early stop via TopK dynamic filter [datafusion]
via GitHub
Re: [I] [Optimizer] Consolidate repeated filter-rebuild patterns in PushDownFilter [datafusion]
via GitHub
Re: [PR] Refactor unary filter pushdown logic for Aggregate and Window [datafusion]
via GitHub
[I] Centralize checked byte-size and offset accounting for variable-size string builders [datafusion]
via GitHub
Re: [I] Centralize checked byte-size and offset accounting for variable-size string builders [datafusion]
via GitHub
[I] Refactor eliminate_outer_join null-rejection analysis to track join sides directly [datafusion]
via GitHub
Re: [I] Refactor eliminate_outer_join null-rejection analysis to track join sides directly [datafusion]
via GitHub
[I] Centralize shared-allocation accounting for Arc DFHeapSize implementations [datafusion]
via GitHub
[PR] Clearly gate sliding SUM(DISTINCT) type support [datafusion]
via GitHub
Re: [PR] Clearly gate sliding SUM(DISTINCT) type support [datafusion]
via GitHub
Re: [PR] Draft: implement non-blocking morsel API [datafusion]
via GitHub
Re: [PR] Draft: implement non-blocking morsel API [datafusion]
via GitHub
[PR] chore(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-ballista]
via GitHub
[PR] chore(deps): bump taiki-e/install-action from 2.81.8 to 2.81.9 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump taiki-e/install-action from 2.81.8 to 2.81.9 [datafusion-ballista]
via GitHub
[PR] fix: allow EvalMode.TRY in CometRemainder to support try_mod [datafusion-comet]
via GitHub
Re: [PR] fix: allow EvalMode.TRY in CometRemainder to support try_mod [datafusion-comet]
via GitHub
Re: [I] [Bug] try_mod falls back to Spark because CometRemainder rejects EvalMode.TRY [datafusion-comet]
via GitHub
Re: [I] [Bug] try_mod falls back to Spark because CometRemainder rejects EvalMode.TRY [datafusion-comet]
via GitHub
Re: [PR] feat(cli): implement mmap based object store for local files [datafusion]
via GitHub
Re: [PR] feat(parquet): support CDC chunking options [datafusion]
via GitHub
Re: [PR] Enable logical planning for `UPDATE ... FROM` and preserve joined assignment qualifiers [datafusion]
via GitHub
Earlier messages