github
Thread
Date
Earlier messages
Later messages
Messages by Date
2025/12/21
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/21
Re: [PR] Add Physical Extension Protobuf Codec [datafusion]
via GitHub
2025/12/21
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/21
Re: [PR] minor: refactoring of some `ScalarValue` code [datafusion]
via GitHub
2025/12/21
Re: [PR] <DRAFT> IN LIST optims [datafusion]
via GitHub
2025/12/21
[PR] minor: refactoring of some `ScalarValue` code [datafusion]
via GitHub
2025/12/21
Re: [PR] Enables DefaultListFilesCache by default [datafusion]
via GitHub
2025/12/21
Re: [PR] Add blog post on extending SQL in DataFusion [datafusion-site]
via GitHub
2025/12/21
[I] Consider adding `TypeSignatureClass::Any` variant [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/21
[PR] Add Physical Extension Protobuf Codec [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/21
Re: [PR] Fix TopK aggregation for UTF-8/Utf8View group keys and add safe fallback for unsupported string aggregates [datafusion]
via GitHub
2025/12/21
Re: [PR] feat: Add decimal support for round [datafusion]
via GitHub
2025/12/21
Re: [PR] build: add prek to run pre-commit hook for native module [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] support negative scale for decimal ScalarValue [datafusion]
via GitHub
2025/12/21
Re: [PR] support negative scale for decimal ScalarValue [datafusion]
via GitHub
2025/12/21
Re: [I] Support for negative scale decimals in ScalarValue [datafusion]
via GitHub
2025/12/21
Re: [I] Support for negative scale decimals in ScalarValue [datafusion]
via GitHub
2025/12/21
[PR] build: add prek to run pre-commit hook for native module [datafusion-comet]
via GitHub
2025/12/21
[I] Alter `ScalarValue` APIs to be more clear when an operation isn't permitted [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/21
Re: [PR] WIP: Upgrade DataFusion to arrow-rs/parquet 57.2.0 [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/21
Re: [PR] build: Reinstate macOS CI builds of Comet with Spark 4.0 [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] fix: simplify IS NULL/IS NOT NULL literals for parquet pruning [datafusion]
via GitHub
2025/12/21
Re: [I] Parquet stats pruning: `IS NULL/IS NOT NULL` Simplification for Literal Arguments [datafusion]
via GitHub
2025/12/21
[PR] build: Reinstate macOS CI builds of Comet with Spark 4.0 [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] [RFC] Add lambda support and array_transform udf [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/21
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/21
Re: [PR] Fix TopK aggregation for UTF-8/Utf8View group keys and add safe fallback for unsupported string aggregates [datafusion]
via GitHub
2025/12/21
Re: [PR] Add blog post on extending SQL in DataFusion [datafusion-site]
via GitHub
2025/12/21
[PR] Add:arrow_metadata() UDF [datafusion]
via GitHub
2025/12/21
Re: [PR] doc: add example for cache factory [datafusion]
via GitHub
2025/12/21
Re: [I] Add example for using `CacheFactory` with `DataFrame::cache` [datafusion]
via GitHub
2025/12/21
Re: [I] Spark 4.0.x hive-1 jdk17 tests failing consistently in multiple PRs [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] fix: enable cast tests for Spark 4.0 [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] fix: format decimal to string when casting to short [datafusion-comet]
via GitHub
2025/12/21
Re: [PR] infer parquet file order from metadata, write order in ParquetSink [datafusion]
via GitHub
2025/12/20
[PR] fix: simplify IS NULL/IS NOT NULL literals for parquet pruning [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: simplify IS NULL/IS NOT NULL literals for parquet pruning [datafusion]
via GitHub
2025/12/20
Re: [I] option to disable evaluation of stable expressions in optimizer rules [datafusion]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [PR] infer parquet file order from metadata, write order in ParquetSink [datafusion]
via GitHub
2025/12/20
[PR] infer parquet file order from metadata, write order in ParquetSink [datafusion]
via GitHub
2025/12/20
Re: [PR] chore: use extend instead of manual loop in multi group by [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: csv schema_infer_max_records set to 0 return null datatype [datafusion]
via GitHub
2025/12/20
[PR] fix: csv schema_infer_max_records set to 0 return null datatype [datafusion]
via GitHub
2025/12/20
Re: [PR] WIP: using arrow-avro. remove own implementation [datafusion]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: NULL handling in arrow_intersect and arrow_union [datafusion]
via GitHub
2025/12/20
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/20
Re: [PR] Add blog post on extending SQL in DataFusion [datafusion-site]
via GitHub
2025/12/20
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/20
Re: [PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/20
[PR] perf: improve `range` and `generate_series` for `Int64` [datafusion]
via GitHub
2025/12/20
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/20
Re: [PR] Respect execution timezone in to_timestamp and related functions [datafusion]
via GitHub
2025/12/20
Re: [PR] bench: add `range_and_generate_series` [datafusion]
via GitHub
2025/12/20
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/20
Re: [PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/20
[PR] perf: skip double lookup in multi group by hash map [datafusion]
via GitHub
2025/12/20
Re: [PR] bench: add `range_and_generate_series` [datafusion]
via GitHub
2025/12/20
Re: [PR] bench: add `range_and_generate_series` [datafusion]
via GitHub
2025/12/20
Re: [PR] chore: use extend instead of manual loop in multi group by [datafusion]
via GitHub
2025/12/20
[PR] chore: use extend instead of manual loop in multi group by [datafusion]
via GitHub
2025/12/20
Re: [PR] Allow flag to alias all projected substrait expressions with a UUID [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: Support string decimal cast [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] feat: Support ANSI mode avg expr (int inputs) [datafusion-comet]
via GitHub
2025/12/20
[PR] bench: add` range_and_generate_series` [datafusion]
via GitHub
2025/12/20
Re: [I] Parquet stats pruning: `IS NULL/IS NOT NULL` Simplification for Literal Arguments [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: integrate batch coalescer with async fn exec [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: integrate batch coalescer with async fn exec [datafusion]
via GitHub
2025/12/20
Re: [PR] chore: deprecate native_comet scan in favor of native_iceberg_compat [datafusion-comet]
via GitHub
2025/12/20
[I] Join of recursive CTEs runs indefinitely [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: NULL handling in arrow_intersect and arrow_union [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: NULL handling in arrow_intersect and arrow_union [datafusion]
via GitHub
2025/12/20
Re: [PR] Add blog post regarding CASE work [datafusion-site]
via GitHub
2025/12/20
Re: [PR] feat: Add support for `unix_timestamp` function [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] build: Skip problematic Spark SQL test for Spark 4.0.x [datafusion-comet]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: Allow log with non-integer base on decimals [datafusion]
via GitHub
2025/12/20
Re: [PR] support negative scale for decimal ScalarValue [datafusion]
via GitHub
2025/12/20
Re: [PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
[PR] chore: deprecate native_comet scan in favor of native_iceberg_compat [datafusion-comet]
via GitHub
2025/12/20
[PR] feat: allow native Iceberg scans with non-identity transform residuals [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] feat: Add partial support for `from_json` [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] feat: Add partial support for `from_json` [datafusion-comet]
via GitHub
2025/12/20
[PR] build: Skip problematic Spark SQL test for Spark 4.0.x [datafusion-comet]
via GitHub
2025/12/20
[I] Spark 4.0.x hive-1 tests failing consistently in multiple PRs [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] ci: add check for doc comment formatting [datafusion]
via GitHub
2025/12/20
Re: [PR] WIP: using arrow-avro. remove own implementation [datafusion]
via GitHub
2025/12/20
Re: [PR] Add blog post regarding CASE work [datafusion-site]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: format decimal to string when casting to short [datafusion-comet]
via GitHub
2025/12/20
Re: [I] Future of Iceberg Support in Comet [datafusion-comet]
via GitHub
2025/12/20
Re: [I] Add ParquetOpenerBuilder to make code more clean and readable [datafusion]
via GitHub
2025/12/20
Re: [PR] refactor: add ParquetOpenerBuilder to reduce test code duplication [datafusion]
via GitHub
2025/12/20
Re: [I] Add Local Scripts to Reproduce Full CI and Perform Auto-Fixes [datafusion]
via GitHub
2025/12/20
Re: [I] [Bug] BinaryView/StringView columns are spilled without GC and results in enormous spill files [datafusion]
via GitHub
2025/12/20
Re: [I] option to disable evaluation of stable expressions in optimizer rules [datafusion]
via GitHub
2025/12/20
[PR] Add option to disable evaluation of stable expressions in optimizer [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: integrate batch coalescer with async fn exec [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: integrate batch coalescer with async fn exec [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
[I] Running Clickbench query 18 fails with "failed to fill whole buffer" error [datafusion]
via GitHub
2025/12/20
[PR] Replace custom merge operator with arrow-rs implementation [datafusion]
via GitHub
2025/12/20
[I] Use `merge` and `merge_n` from `arrow-rs` instead of custom implementations [datafusion]
via GitHub
2025/12/20
Re: [PR] Introduce way to customize prefix of multi file outputs [datafusion]
via GitHub
2025/12/20
Re: [PR] docs: Fix upgrade guide API examples for FileScanConfigBuilder and ParquetSource [datafusion]
via GitHub
2025/12/20
Re: [I] [Bug] BinaryView/StringView columns are spilled without GC and results in enormous spill files [datafusion]
via GitHub
2025/12/20
Re: [PR] Remove core dependency from ffi [datafusion]
via GitHub
2025/12/20
Re: [PR] Remove core dependency from ffi [datafusion]
via GitHub
2025/12/20
[PR] Remove core dependency from ffi [datafusion]
via GitHub
2025/12/20
Re: [I] Add Local Scripts to Reproduce Full CI and Perform Auto-Fixes [datafusion]
via GitHub
2025/12/20
Re: [PR] Add blog post on extending SQL in DataFusion [datafusion-site]
via GitHub
2025/12/20
Re: [PR] feat: update FFI TableProvider and ExecutionPlan to use FFI Session and TaskContext [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: update FFI TableProvider and ExecutionPlan to use FFI Session and TaskContext [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] Optimize hashing for StringView and ByteView (15-70% faster) [datafusion]
via GitHub
2025/12/20
Re: [PR] consecutive repartitions blog post [datafusion-site]
via GitHub
2025/12/20
Re: [I] Datafusion blog is giving a 404 starting Nov 20, 2024 [datafusion-site]
via GitHub
2025/12/20
Re: [I] DataFusion blog is broken [datafusion]
via GitHub
2025/12/20
Re: [I] Datafusion blog is giving a 404 starting Nov 20, 2024 [datafusion-site]
via GitHub
2025/12/20
Re: [I] DataFusion blog is broken [datafusion]
via GitHub
2025/12/20
Re: [I] DataFusion blog is broken [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] Update .asf.yaml comments [datafusion-site]
via GitHub
2025/12/20
Re: [PR] Optimize hashing for StringView and ByteView (15-70% faster) [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] feat: integrate batch coalescer with async fn exec [datafusion]
via GitHub
2025/12/20
Re: [PR] docs: Fix upgrade guide API examples for FileScanConfigBuilder and ParquetSource [datafusion]
via GitHub
2025/12/20
Re: [PR] Fix panic after spill to disk in clickbench [datafusion]
via GitHub
2025/12/20
[PR] Fix panic after spill to disk in clickbench [datafusion]
via GitHub
2025/12/20
Re: [I] Grouped aggregations with many distinct groups do not respect memory limit when input is sorted [datafusion]
via GitHub
2025/12/20
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/20
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/20
Re: [PR] Add blog post regarding CASE work [datafusion-site]
via GitHub
2025/12/20
[PR] feat: Add `preselection_threshold` option to `BinaryExpr` [datafusion]
via GitHub
2025/12/20
Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]
via GitHub
2025/12/20
Re: [PR] perfect hash join [datafusion]
via GitHub
2025/12/20
Re: [PR] fix: enable cast tests for Spark 4.0 [datafusion-comet]
via GitHub
2025/12/20
Re: [PR] <DRAFT> IN LIST optims [datafusion]
via GitHub
2025/12/20
Re: [PR] <DRAFT> IN LIST optims [datafusion]
via GitHub
2025/12/20
Re: [PR] Optimize hashing for StringView and ByteView (15-70% faster) [datafusion]
via GitHub
2025/12/20
Re: [PR] <DRAFT> IN LIST optims [datafusion]
via GitHub
2025/12/20
Re: [PR] Optimize muti-column grouping with StringView/ByteView (option 2) - 25% faster [datafusion]
via GitHub
2025/12/20
Re: [PR] [TEST] Improve combine_hashes [datafusion]
via GitHub
2025/12/20
Re: [PR] [TEST] Improve combine_hashes [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: output statistics for constant columns in projections [datafusion]
via GitHub
2025/12/19
Re: [PR] WIP: using arrow-avro. remove own implementation [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: output statistics for constant columns in projections [datafusion]
via GitHub
2025/12/19
[PR] feat: output statistics for constant columns in projections [datafusion]
via GitHub
2025/12/19
Re: [PR] Store example data directly inside the datafusion-examples (#19141) [datafusion]
via GitHub
2025/12/19
Re: [PR] added support for negative scale for log decimal32/64 and power [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: allow to skip named parameters and fill skipped with NULL [datafusion]
via GitHub
2025/12/19
Re: [PR] Store example data directly inside the datafusion-examples (#19141) [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: Allow log with non-integer base on decimals [datafusion]
via GitHub
2025/12/19
Re: [I] option to disable evaluation of stable expressions in optimizer rules [datafusion]
via GitHub
2025/12/19
Re: [PR] fix: NULL handling in arrow_intersect and arrow_union [datafusion]
via GitHub
2025/12/19
Re: [I] Constant Columns should output statistics [datafusion]
via GitHub
2025/12/19
Re: [I] SchemaMapping.map_column_statistics produce column_statistics mismatch. [datafusion]
via GitHub
2025/12/19
Re: [I] SchemaMapping.map_column_statistics produce column_statistics mismatch. [datafusion]
via GitHub
2025/12/19
Re: [PR] fix: enable cast tests for Spark 4.0 [datafusion-comet]
via GitHub
2025/12/19
Re: [PR] fix: format decimal to string when casting to short [datafusion-comet]
via GitHub
2025/12/19
Re: [PR] refactor: add ParquetOpenerBuilder to reduce test code duplication [datafusion]
via GitHub
2025/12/19
Re: [PR] refactor: add ParquetOpenerBuilder to reduce test code duplication [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: Add decimal support for round [datafusion]
via GitHub
2025/12/19
Re: [PR] fix: NULL handling in arrow_intersect and arrow_union [datafusion]
via GitHub
2025/12/19
[I] option to disable evaluation of stable expressions in optimizer rules [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: Create benchmarks comet cast [datafusion-comet]
via GitHub
2025/12/19
Re: [PR] perfect hash join [datafusion]
via GitHub
2025/12/19
[I] [Regression] No longer possible to disable CSV schema inference [datafusion]
via GitHub
2025/12/19
Re: [PR] Fix: eliminate unnecessary repartitioning for small datasets [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: Create benchmarks comet cast [datafusion-comet]
via GitHub
2025/12/19
Re: [I] Teach the type arithmetic code to support date + time -> timestamp [datafusion]
via GitHub
2025/12/19
Re: [I] Teach the type arithmetic code to support Timestamp + Duration(Second) -> timestamp [datafusion]
via GitHub
2025/12/19
Re: [PR] Respect execution timezone in to_timestamp and related functions [datafusion]
via GitHub
2025/12/19
Re: [PR] Respect execution timezone in to_timestamp and related functions [datafusion]
via GitHub
2025/12/19
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/19
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/19
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/19
Re: [I] Panic when ClickBench Q18 with memory limited to 1G: range end index 1943308 out of range for slice of length 193245 [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: hash partitioning satisfies subset [datafusion]
via GitHub
2025/12/19
Re: [I] Hash partitioning should satisfy hash subset partitioning [datafusion]
via GitHub
2025/12/19
Re: [PR] feat: hash partitioning satisfies subset [datafusion]
via GitHub
2025/12/19
Re: [PR] Update date_bin to support Time32 and Time64 data types [datafusion]
via GitHub
2025/12/19
Re: [I] [Bug] BinaryView/StringView columns are spilled without GC and results in enormous spill files [datafusion]
via GitHub
2025/12/19
Re: [PR] Update date_bin to support Time32 and Time64 data types [datafusion]
via GitHub
2025/12/19
Re: [PR] Update date_bin to support Time32 and Time64 data types [datafusion]
via GitHub
2025/12/19
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
2025/12/19
Re: [PR] Implement disk spilling for all grouping ordering modes in GroupedHashAggregateStream [datafusion]
via GitHub
Earlier messages
Later messages