github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/05/04
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
2026/05/04
[PR] feat: support AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Write a wikipedia article for Apache DataFusion [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [I] Add cargo-fuzz / OSS-Fuzz coverage for DataFusion's parser and analyzer [datafusion]
via GitHub
2026/05/04
Re: [I] Comet should fallback to Spark for streaming queries [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Unsupported aggregation mode PartialMerge [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmarks for dictionary path of new_group_values [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for PostgreSQL's ORDER BY ... USING <operator> clause [datafusion-sqlparser-rs]
via GitHub
2026/05/04
Re: [PR] Add benchmarks for dictionary path of new_group_values [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmark_runner for sql_benchmarks with help and list commands [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmark_runner for sql_benchmarks with help and list commands [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmark_runner for sql_benchmarks with help and list commands [datafusion]
via GitHub
2026/05/04
Re: [I] Add cargo-fuzz / OSS-Fuzz coverage for DataFusion's parser and analyzer [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] test: extend SPARK-43402 plan-match to CometNativeScanExec and retag to #4042 [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] test: extend SPARK-43402 plan-match to CometNativeScanExec and retag to #4042 [datafusion-comet]
via GitHub
2026/05/04
[PR] fix: include per-column details in exportBatch row count mismatch error [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] ci: add `auto detected api change` label on breaking change detecting in the CI [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmark_runner for sql_benchmarks with help and list commands [datafusion]
via GitHub
2026/05/04
Re: [PR] Map ProfileCredentialsProvider to profiel credential chain [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Add lambda substrait support [datafusion]
via GitHub
2026/05/04
[PR] Map ProfileCredentialsProvider to profiel credential chain [datafusion-comet]
via GitHub
2026/05/04
[I] Support AWS ProfileCredentialsProvider in native S3 object store [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/04
Re: [PR] Upgrade to arrow-rs / parquet / avro 58.2.0 [datafusion]
via GitHub
2026/05/04
[I] Number of rows in each column should be the same, but got [ArrayBuffer(8192, 0)] [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Release DataFusion `54.0.0` (Apr 2026 / May 2026) [datafusion]
via GitHub
2026/05/04
Re: [I] [DISCUSSION] Extending Partitioning to Support More Variants [datafusion]
via GitHub
2026/05/04
Re: [PR] [WIP] Explore extensible range partitioning for dynamic filters [datafusion]
via GitHub
2026/05/04
Re: [I] [DISCUSSION] Extending Partitioning to Support More Variants [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: Making From<ConfigFileDecryptionProperties/ConfigFileEncryptionProperties> conversions fallible with `TryFrom` [datafusion]
via GitHub
2026/05/04
Re: [PR] Skip RowFilter and page pruning for fully matched row groups [datafusion]
via GitHub
2026/05/04
Re: [PR] fix(spark-expr): preserve scalar tag in WideDecimalBinaryExpr when both inputs are scalars [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: Making From<ConfigFileDecryptionProperties/ConfigFileEncryptionProperties> conversions fallible with `TryFrom` [datafusion]
via GitHub
2026/05/04
Re: [PR] fix: `median` returns Float64 for integer inputs to avoid truncation [datafusion]
via GitHub
2026/05/04
Re: [PR] Add benchmarks for dictionary path of new_group_values [datafusion]
via GitHub
2026/05/04
Re: [PR] Add reusable plan-time schema alignment helper and apply to RecursiveQueryExec [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: globally reorder files and row groups by statistics for TopK queries [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: Optimize `reverse` using bulk-NULL string builders [datafusion]
via GitHub
2026/05/04
Re: [I] Optimize `reverse` with bulk-NULL string builders [datafusion]
via GitHub
2026/05/04
Re: [PR] fix(spark-expr): preserve scalar tag in WideDecimalBinaryExpr when both inputs are scalars [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/05/04
Re: [PR] Skip RowFilter and page pruning for fully matched row groups [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: Optimize `reverse` using bulk-NULL string builders [datafusion]
via GitHub
2026/05/04
Re: [I] CaseWhen does not work with custom implemented column expression [datafusion]
via GitHub
2026/05/04
Re: [PR] Skip RowFilter and page pruning for fully matched row groups [datafusion]
via GitHub
2026/05/04
Re: [PR] Skip RowFilter and page pruning for fully matched row groups [datafusion]
via GitHub
2026/05/04
Re: [PR] Skip RowFilter and page pruning for fully matched row groups [datafusion]
via GitHub
2026/05/04
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/05/04
Re: [I] Native scan path doesn't honour Parquet field-ID matching when spark.sql.parquet.fieldId.read.enabled=true [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: globally reorder files and row groups by statistics for TopK queries [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: globally reorder files and row groups by statistics for TopK queries [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: globally reorder files and row groups by statistics for TopK queries [datafusion]
via GitHub
2026/05/04
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
2026/05/04
[PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
2026/05/04
[PR] docs: document Spark version labels in bug triage guide [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: support spark compatible floor function [datafusion]
via GitHub
2026/05/04
Re: [PR] physical-expr-common: remote `PhysicalExpr::snapshot` method [datafusion]
via GitHub
2026/05/04
Re: [PR] physical-expr-common: remote `PhysicalExpr::snapshot` method [datafusion]
via GitHub
2026/05/04
Re: [I] Set up process for auditing all Spark commits to assess impact on Comet [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] physical-expr-common: remote `PhysicalExpr::snapshot` method [datafusion]
via GitHub
2026/05/04
Re: [I] Native scan path doesn't honour Parquet field-ID matching when spark.sql.parquet.fieldId.read.enabled=true [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] fix: fall back to Spark for Parquet scans containing TimestampNTZ [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
[PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/04
Re: [I] Track Spark 4.2 test failures [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Track Spark 4.2 test failures [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] test: [Spark 4.1.1] unignore CachedBatchSerializerNoUnwrapSuite [datafusion-comet]
via GitHub
2026/05/04
Re: [I] CachedBatchSerializerNoUnwrapSuite: Comet replaces WholeStageCodegenExec [datafusion-comet]
via GitHub
2026/05/04
Re: [I] CaseWhen does not work with custom implemented column expression [datafusion]
via GitHub
2026/05/04
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] chore: fix spark ansi sum incompatibility message [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Spark 4.1 NullType parquet: parquet-rs rejects BOOLEAN + Unknown logical type [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
2026/05/04
[PR] WIP: windows support [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Query-aware statistics requests via ScanArgs / ScanResult (RFC for #21624) [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] feat(functions-nested): add array_filter higher-order function [datafusion]
via GitHub
2026/05/04
Re: [PR] docs: start Spark 4.1 known-limitations section, seeded with #4199 [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Requirements for scalar UDF preimage [datafusion]
via GitHub
2026/05/04
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/05/04
[PR] build: Enable Spark SQL tests for Spark 4.2.0-preview4 [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: Optimise convert_to_state for SUM and BIT_OR_XOR [datafusion]
via GitHub
2026/05/04
Re: [PR] Fix fully matched row groups with null counts [datafusion]
via GitHub
2026/05/04
[PR] fix: [Spark 4.1] preserve union output partitioning in CometUnionExec [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: type-keyed extensions map for PartitionedFile [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
[PR] feat: add Spark commit audit process [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: type-keyed extensions map for PartitionedFile [datafusion]
via GitHub
2026/05/04
[I] Create `spark-latest` profile [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: type-keyed extensions map for PartitionedFile [datafusion]
via GitHub
2026/05/04
Re: [PR] Add a memory bound FileStatisticsCache for the Listing Table [datafusion]
via GitHub
2026/05/04
[PR] test: [Spark 4.1.1] unignore CachedBatchSerializerNoUnwrapSuite [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Set up process for auditing all Spark commits to assess impact on Comet [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] functions: Add dict support for get field [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
2026/05/04
[I] Bug triage results: 2026-05-04 [datafusion-comet]
via GitHub
2026/05/04
Re: [I] Bug triage results: 2026-04-27 [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] feat: add bug-triage Claude skill [datafusion-comet]
via GitHub
2026/05/04
[PR] docs: start Spark 4.1 known-limitations section, seeded with #4199 [datafusion-comet]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] ci: add `auto detected api change` label on breaking change detecting in the CI [datafusion]
via GitHub
2026/05/04
Re: [I] Breaking change detector: adding "auto detect api change" label on detection [datafusion]
via GitHub
2026/05/04
Re: [PR] ci: add `auto detected api change` label on breaking change detecting in the CI [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
[PR] Optimize ClickBench q17 aggregate limit [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: type-keyed extensions map for PartitionedFile [datafusion]
via GitHub
2026/05/04
Re: [PR] Add reusable plan-time schema alignment helper and apply to RecursiveQueryExec [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] Add support for lambda column capture [datafusion]
via GitHub
2026/05/04
Re: [PR] feat: TABLESAMPLE SYSTEM end-to-end + row-group / row sampling on ParquetSource [datafusion]
via GitHub
2026/05/04
Re: [PR] Feat/map sql extension types [datafusion]
via GitHub
2026/05/04
Re: [PR] Explicitly declare spill codec dependency in `physical-plan` [datafusion]
via GitHub
2026/05/04
Re: [PR] Explicitly declare spill codec dependency in `physical-plan` [datafusion]
via GitHub
2026/05/04
Re: [PR] chore(deps): bump graphviz-rust from 0.9.7 to 0.9.8 [datafusion-ballista]
via GitHub
2026/05/04
Re: [PR] feat: Support RIGHT/FULL joins in NLJ memory-limited execution [datafusion]
via GitHub
2026/05/04
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/04
Re: [I] Make spill codec availability an explicit contract of the spill stack [datafusion]
via GitHub
2026/05/04
Re: [PR] ci: add `auto detected api change` label on breaking change detecting in the CI [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] bench: add to_char_array_date32 [datafusion]
via GitHub
2026/05/03
Re: [PR] bench: add to_char_array_date32 [datafusion]
via GitHub
2026/05/03
Re: [I] Support array_agg as a sliding window aggregate by implementing retract_batch [datafusion]
via GitHub
2026/05/03
Re: [PR] feat: Implement `datafusion-spark` `sequence` function [datafusion]
via GitHub
2026/05/03
Re: [I] Extend `datafusion-spark` `sequence` function [datafusion]
via GitHub
2026/05/03
[I] Extend `datafusion-spark` `sequence` function [datafusion]
via GitHub
2026/05/03
[PR] chore(deps): bump graphviz-rust from 0.9.7 to 0.9.8 [datafusion-ballista]
via GitHub
2026/05/03
[PR] chore(deps): bump ctor from 0.12.0 to 1.0.0 [datafusion-ballista]
via GitHub
2026/05/03
[PR] chore(deps): bump github/codeql-action from 4.35.2 to 4.35.3 [datafusion-ballista]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
[PR] bench: add to_char_array_date32 [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] perf: Cast entire Date32 array to Date64 on 1st failure [datafusion]
via GitHub
2026/05/03
Re: [PR] fix(metrics): avoid stage metrics inflation by tracking partition snapshots [datafusion-ballista]
via GitHub
2026/05/03
Re: [PR] feat: Add dialect-aware SQL serialization with `ToSql` trait for ClickHouse PascalCase types [datafusion-sqlparser-rs]
via GitHub
2026/05/03
Re: [PR] feat: support url encode expression [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] feat: Support Spark expression minutes_of_time [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] feat: support url_decode expression via StaticInvoke [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] [WIP] feat: support binary lpad/rpad via StaticInvoke [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] chore(deps): bump sphinx from 8.2.3 to 9.1.0 in /docs [datafusion-sandbox]
via GitHub
2026/05/03
Re: [PR] Iceberg v3 support - enable and initial version [iceberg] [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] Acceleration : Iceberg table compaction [iceberg] [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] [WIP] feat: support charTypeWriteSideCheck and varcharTypeWriteSideCheck via StaticInvoke [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] Use datafusion-spark SparkArrayContains for three-valued NULL semantics [datafusion-comet]
via GitHub
2026/05/03
Re: [PR] chore(deps): bump myst-parser from 4.0.1 to 5.0.0 in /docs [datafusion-sandbox]
via GitHub
2026/05/03
Re: [PR] chore(deps): bump sphinx from 8.2.3 to 9.1.0 in /docs [datafusion-sandbox]
via GitHub
2026/05/03
Re: [PR] chore(deps): bump myst-parser from 4.0.1 to 5.0.0 in /docs [datafusion-sandbox]
via GitHub
Earlier messages
Later messages