github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/18
[PR] Implement FixedSizeBinary zero-copy reinterpretation optimization [datafusion]
via GitHub
2026/06/18
[PR] Implement Legacy String Optimization (Utf8TwoStageFilter) [datafusion]
via GitHub
2026/06/18
[PR] Extend Bitmap Filter to UInt16 (Heap-based) [datafusion]
via GitHub
2026/06/18
[PR] Implement String View (Utf8View/BinaryView) Optimizations [datafusion]
via GitHub
2026/06/18
[PR] Implement Direct Probe (Hash) Filter for large primitive lists [datafusion]
via GitHub
2026/06/18
[PR] Implement Branchless Filter for small primitive lists [datafusion]
via GitHub
2026/06/18
[PR] Implement Zero-Copy Reinterpretation and enable Int8/Int16 Bitmaps [datafusion]
via GitHub
2026/06/18
[PR] Implement Bitmap Filter for UInt8 (Stack-based) [datafusion]
via GitHub
2026/06/18
Re: [I] Substrait consumer: DuplicateUnqualifiedField when chaining two Window relations with a carried window column [datafusion]
via GitHub
2026/06/18
Re: [I] Cleanup: Name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
[I] Refactor: Make join projection pushdown schema-aware via `ColumnIndex` / `JoinSide` [datafusion]
via GitHub
2026/06/18
[PR] chore(deps-dev): bump webpack-dev-server from 5.2.4 to 5.2.5 in /datafusion/wasmtest/datafusion-wasm-app [datafusion]
via GitHub
2026/06/18
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor: make scalar distance u64 and overflow aware [datafusion]
via GitHub
2026/06/17
Re: [I] Make Scalar Distance Overflow-Aware and Domain-Sized [datafusion]
via GitHub
2026/06/17
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/17
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/17
Re: [I] Further improve performance of IN list evaluation [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: parquet limit pruning for row group selections [datafusion]
via GitHub
2026/06/17
[I] Cleanup: Name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/17
Re: [I] limit pruning ignores `RowSelection` [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: parquet limit pruning for row group selections [datafusion]
via GitHub
2026/06/17
Re: [I] [datafusion-spark] Optimize hex function [datafusion]
via GitHub
2026/06/17
[I] Substrait consumer: DuplicateUnqualifiedField when chaining two Window relations with a carried window column [datafusion]
via GitHub
2026/06/17
Re: [I] Hash join should omit NULLs from build side under `NullEqualsNothing` [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/17
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/06/17
Re: [I] Attach `Diagnostic` to "incompatible type in unary expression" error [datafusion]
via GitHub
2026/06/17
[PR] Doris SQL: create table column options [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[I] Move unary diagnostic logic from sql planner to analyzer [datafusion]
via GitHub
2026/06/17
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/06/17
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/17
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/06/17
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/17
Re: [PR] Doris SQL: create table model [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
[PR] Doris SQL: create table model [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] feat: Add Spark-compatible `encode` function to datafusion-spark [datafusion]
via GitHub
2026/06/17
Re: [PR] Add FixedSizeList support for recursive struct schema adaptation [datafusion]
via GitHub
2026/06/17
Re: [PR] Hive: Support DISTRIBUTE BY and SORT BY in window specs [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
[PR] chore(deps): bump tui-big-text from 0.8.7 to 0.8.8 [datafusion-ballista]
via GitHub
2026/06/17
[PR] chore(deps): bump taiki-e/install-action from 2.81.11 to 2.82.0 [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/17
Re: [I] `ProjectionPushdown` internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/17
Re: [PR] Unify AVG group state conversion and filter handling across Spark and built-in accumulators [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [I] Reduce Github Action Usage [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/17
Re: [PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] Validate coerce int96 config 17498 [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: multiple columns in count distinct [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: add PostgreSQL EXCLUDE constraint parsing [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] Reduce Github Action Usage [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/17
[PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/17
Re: [PR] Skip loading Parquet page index when row-group statistics already prove it cannot prune [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [I] RAT check in Maven build scans temporary release/scratch directories, making release builds slow [datafusion-comet]
via GitHub
2026/06/17
[I] RAT check in Maven build scans temporary release/scratch directories, making release builds slow [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] feat: improve pythonic interface on date/time functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump prost from 0.14.3 to 0.14.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump prost from 0.14.3 to 0.14.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump pyo3-log from 0.13.3 to 0.13.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump pyo3-log from 0.13.3 to 0.13.4 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): bump uuid from 1.23.2 to 1.23.3 [datafusion-python]
via GitHub
2026/06/17
Re: [PR] chore: update rust dependencies [datafusion-python]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] chore: Test update object store to 0.14.0 [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
[PR] chore: Test update object store to 0.14.0 [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Omit NULL values from build side of hash joins [datafusion]
via GitHub
2026/06/17
[PR] Add Hotdata to the "known users" list in introduction.md [datafusion]
via GitHub
2026/06/17
Re: [I] perf: use aligned slice access in SparkUnsafeArray bulk append [datafusion-comet]
via GitHub
2026/06/17
[PR] chore: update rust dependencies [datafusion-python]
via GitHub
2026/06/17
Re: [PR] build(deps): batch dependabot dependency updates [datafusion-python]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor: make scalar distance u64 and overflow aware [datafusion]
via GitHub
2026/06/17
[PR] Add sorted TopK TPC-H benchmark target [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
[PR] Experimental: support parquet partition write [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [I] Expose Spark functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
[PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
Re: [PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: Add new `input_file_name` UDF for file-backed scans [datafusion]
via GitHub
2026/06/17
[PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/17
[I] Unparser: support unparsing binary scalars [datafusion]
via GitHub
2026/06/17
[PR] Feat/unparser spaceship [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] RIGHT/FULL/NATURAL JOIN ... USING(k) does not coalesce the join key (returns NULL for right-only rows) [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
[PR] fix: coalesce the merged key of RIGHT/FULL USING/NATURAL joins [datafusion]
via GitHub
2026/06/17
Re: [PR] refactor: thread SubqueryContext explicitly through physical planning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/17
[I] Unparser: support spaceship operator [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Refactor eliminate_outer_join null-rejection tracking to side-level state [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: expose spark-compatible functions [datafusion-python]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/17
Re: [PR] Refactor outer join null-rejection analysis to track join sides directly [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: add PostgreSQL EXCLUDE constraint parsing [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/17
Re: [I] [EPIC] TUI Improvements [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
[PR] implement map_agg [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
[I] Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
[PR] fix: Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
Re: [I] Parquet bloom filter pruning can incorrectly filter decimals encoded as FIXED_LEN_BYTE_ARRAY [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Improve job failure handling [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: ProjectionPushdown internal error on NestedLoopJoin mark joins [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/17
[PR] Revert Teradata dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: Preserve integer values in round() for large Int64 and UInt64 inputs [datafusion]
via GitHub
2026/06/17
Re: [PR] Optimize Parquet row-filter struct schema pruning [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/17
Re: [PR] Do not consume statement terminator in unparenthesized option lists [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.checkpoint()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/17
[I] NewType pattern for executor id's [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
[I] Implement Aggregate function `map_agg` [datafusion]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
Re: [PR] Remove sqllogictest fork swap from regenerate_sqlite_files.sh [datafusion]
via GitHub
2026/06/17
Re: [PR] Update expected results for duplicate column names fix [datafusion-testing]
via GitHub
2026/06/17
Re: [PR] perf: address scheduler crash on high partition SF data [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
[PR] fix: coerce SIMILAR TO operands to a common string type [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] fix: coerce SIMILAR TO operands to a common string type to avoid 'failed to downcast array' panic [datafusion]
via GitHub
2026/06/17
Re: [I] Add support for `DataFrame.cache()` to Ballista [datafusion-ballista]
via GitHub
2026/06/17
Re: [I] `ExecutorMetatada` too verbose [datafusion-ballista]
via GitHub
2026/06/17
[PR] Doris SQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
Re: [PR] feat: warn on NULL equality predicates [datafusion]
via GitHub
2026/06/17
[I] DorisSQL: add Doris Dialect [datafusion-sqlparser-rs]
via GitHub
2026/06/17
[PR] Fix shared TopK early exit with global prefix threshold [datafusion]
via GitHub
2026/06/17
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/17
Re: [PR] fix: graceful error for deeply nested expressions instead of stack overflow [datafusion]
via GitHub
2026/06/17
Re: [I] Improve shuffle (column) statistics [datafusion-ballista]
via GitHub
2026/06/17
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
Earlier messages
Later messages