github

Messages by Thread

[PR] fix: skip null slots when checking overflow in unary negation [datafusion-comet] via GitHub
[PR] Measure pool_peak_bytes per iteration and report the smallest [datafusion] via GitHub
- Re: [PR] Measure pool_peak_bytes per iteration and report the smallest [datafusion] via GitHub
Re: [I] Native Sort-Merge Writer for Iceberg ClusteredWriter Path [datafusion-comet] via GitHub
Re: [PR] feat: add IEJoin for joins with two range predicates [datafusion] via GitHub
Re: [I] Analysis to support`SortPreservingMerge` --> `ProgressiveEval` [datafusion] via GitHub
- Re: [I] Analysis to support`SortPreservingMerge` --> `ProgressiveEval` [datafusion] via GitHub
[PR] feat: support ANSI interval types in native Parquet scans [datafusion-comet] via GitHub
Re: [PR] feat: enable external reclaim for mem spillable df operators [datafusion] via GitHub
[PR] refactor: use Arrow decimal precision validation [datafusion-comet] via GitHub
[I] ScalarValue::new_list silently discards its data_type argument for non-empty input [datafusion] via GitHub
- Re: [I] ScalarValue::new_list silently discards its data_type argument for non-empty input [datafusion] via GitHub
[PR] fix: make collect_list/collect_set argument coercion a normalization barrier [datafusion-comet] via GitHub
[I] collect_list/collect_set can fail with "column types must match schema types" on nested-field nullability drift [datafusion-comet] via GitHub
[PR] feat: native Arrow UDF path to grouped-aggregate, window, and applyInArrow Python operators [datafusion-comet] via GitHub
[I] Add `cli` label for bugs and PRs? [datafusion] via GitHub
[PR] fix: [branch-1.0] raise REMAINDER_BY_ZERO for Float/Double under ANSI mode (#5081) [datafusion-comet] via GitHub
- Re: [PR] fix: [branch-1.0] raise REMAINDER_BY_ZERO for Float/Double under ANSI mode (#5081) [datafusion-comet] via GitHub
Re: [I] Comet 0.15.1 Release [datafusion-comet] via GitHub
- Re: [I] Comet 0.15.1 Release [datafusion-comet] via GitHub
[PR] fix: reduce log verbosity at per-task logging callsites [datafusion-comet] via GitHub
[PR] docs: drop compatibility notes for fixed bugs, rescope string-cast trim note [datafusion-comet] via GitHub
[PR] chore(`HashJoinStream`): rewrite to generators and simplify [datafusion] via GitHub
- Re: [PR] chore(`HashJoinStream`): rewrite to generators and simplify [datafusion] via GitHub
[PR] fix: [branch-1.0] seed native Parquet scan reader options from session config (#5107) [datafusion-comet] via GitHub
- Re: [PR] fix: [branch-1.0] seed native Parquet scan reader options from session config (#5107) [datafusion-comet] via GitHub
Re: [I] Native Parquet scan ignores session-level datafusion.execution.parquet.* options [datafusion-comet] via GitHub
Re: [PR] feat: support Iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: support Iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: support Iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: support Iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: support Iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
[PR] Escape closing brackets in bracket-quoted identifiers [datafusion-sqlparser-rs] via GitHub
Re: [PR] feat: [iceberg] support iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: [iceberg] support iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: [iceberg] support iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
- Re: [PR] feat: [iceberg] support iceberg metadata columns _pos, _spec, _file, and _partition [datafusion-comet] via GitHub
Re: [I] chore: Fix Scala code warnings [datafusion-comet] via GitHub
[I] Restore the `From` / `TryFrom` proto conversions dropped since 54.1.0, and retire `FromProto` / `TryFromProto` [datafusion] via GitHub
[PR] feat: expose Comet version as spark.comet.version runtime config (#5049) [datafusion-comet] via GitHub
- Re: [PR] feat: [branch-1.0] expose Comet version as spark.comet.version runtime config (#5049) [datafusion-comet] via GitHub
[PR] Js/dynamic filters method [datafusion] via GitHub
- Re: [PR] Reapply "Add ExecutionPlan::apply_expressions() (apache#20337)" (apache#22437) [datafusion] via GitHub
- Re: [PR] Reapply "Add ExecutionPlan::apply_expressions() (apache#20337)" (apache#22437) [datafusion] via GitHub
- Re: [PR] Reapply "Add ExecutionPlan::apply_expressions() (apache#20337)" (apache#22437) [datafusion] via GitHub
- Re: [PR] Reapply "Add ExecutionPlan::apply_expressions() (apache#20337)" (apache#22437) [datafusion] via GitHub
- Re: [PR] Reapply "Add ExecutionPlan::apply_expressions() (apache#20337)" (apache#22437) [datafusion] via GitHub
[PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore: convert `PartialHashAggregateStream` and `FinalHashAggregateStream` to async generators and cleanup [datafusion] via GitHub
[PR] move single hash agg stream to generators and less state [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
- Re: [PR] chore(`SingleHashAggregateStream`): refactor to async generator implementation [datafusion] via GitHub
[PR] Support PostgreSQL `ALTER DEFAULT PRIVILEGES` [datafusion-sqlparser-rs] via GitHub
Re: [I] Support per-SessionContext object store credentials without os.environ (thread-safety) [datafusion-python] via GitHub
Re: [PR] docs: fix bootstrap instructions in contributor guide [datafusion-python] via GitHub
- Re: [PR] docs: fix bootstrap instructions in contributor guide [datafusion-python] via GitHub
[PR] chore(`PartialReduceHashAggregateStream`): convert to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore(`PartialReduceHashAggregateStream`): convert to async generators and cleanup [datafusion] via GitHub
- Re: [PR] chore(`PartialReduceHashAggregateStream`): convert to async generators and cleanup [datafusion] via GitHub
[PR] feat: [branch-1.0] disable native columnar-to-row conversion by default (#5114) [datafusion-comet] via GitHub
- Re: [PR] feat: [branch-1.0] disable native columnar-to-row conversion by default (#5114) [datafusion-comet] via GitHub
[PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 7 updates [datafusion] via GitHub
[D] DISCUSSION: San Francisco DataFusion Meetup (August 13, 2026) [datafusion] via GitHub
[PR] chore: cleanup `OrderedPartialAggregateStream` more [datafusion] via GitHub
- Re: [PR] chore: cleanup `OrderedPartialAggregateStream` more [datafusion] via GitHub
- Re: [PR] chore: cleanup `OrderedPartialAggregateStream` more [datafusion] via GitHub
- [PR] chore: cleanup `OrderedPartialAggregateStream` more [datafusion] via GitHub
- Re: [PR] chore: cleanup `OrderedPartialAggregateStream` more [datafusion] via GitHub
[PR] Stop printing a trailing space after `PUBLIC` in `GRANT` and `REVOKE` [datafusion-sqlparser-rs] via GitHub
[PR] Print `GRANT` clauses in the order the parser reads them [datafusion-sqlparser-rs] via GitHub
[I] Populate FileWriteMetadata.format_metadata with Thrift-serialized Parquet footer for FFI consumers [datafusion] via GitHub
[PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 8 updates [datafusion] via GitHub
- Re: [PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 8 updates [datafusion] via GitHub
- Re: [PR] chore(deps): bump the all-other-cargo-deps group across 1 directory with 8 updates [datafusion] via GitHub
[PR] fix: match Spark's whitespace trim semantics for casts from string to boolean, integral, float/double and decimal [datafusion-comet] via GitHub
- Re: [PR] fix: match Spark's whitespace trim semantics for casts from string to boolean, integral, float/double and decimal [datafusion-comet] via GitHub
- Re: [PR] fix: match Spark's whitespace trim semantics for casts from string to boolean, integral, float/double and decimal [datafusion-comet] via GitHub
[PR] chore(OrderedFinalAggregateStream): refactor to async generator [datafusion] via GitHub
- Re: [PR] chore(`OrderedFinalAggregateStream`): refactor to async generator [datafusion] via GitHub
- Re: [PR] chore(`OrderedFinalAggregateStream`): refactor to async generator [datafusion] via GitHub
- Re: [PR] chore(`OrderedFinalAggregateStream`): refactor to async generator [datafusion] via GitHub
- Re: [PR] chore(`OrderedFinalAggregateStream`): refactor to async generator [datafusion] via GitHub
- Re: [PR] chore(`OrderedFinalAggregateStream`): refactor to async generator [datafusion] via GitHub
Re: [I] cast string to boolean: trim ISO control bytes to match Spark's UTF8String.trimAll [datafusion-comet] via GitHub
- Re: [I] cast string to boolean: trim ISO control bytes to match Spark's UTF8String.trimAll [datafusion-comet] via GitHub
- Re: [I] cast string to boolean: trim ISO control bytes to match Spark's UTF8String.trimAll [datafusion-comet] via GitHub
[I] [EPIC] cast from string: trim semantics diverge from Spark across all numeric, datetime and boolean targets [datafusion-comet] via GitHub
[PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
- Re: [PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
- Re: [PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
- Re: [PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
- Re: [PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
- Re: [PR] fix(scheduler): fail running jobs when all executors are lost [datafusion-ballista] via GitHub
[PR] feat(core): scaffold PrefixMergeExec for cross-partition windowed-aggregate state merge [datafusion-ballista] via GitHub
[PR] feat(physical-plan): expose finalized Accumulator state on BoundedWindowAggExec [datafusion] via GitHub
- Re: [PR] feat(physical-plan): expose finalized Accumulator state on BoundedWindowAggExec [datafusion] via GitHub
[PR] Accept CREATE SEQUENCE options in any order [datafusion-sqlparser-rs] via GitHub
[I] avg on decimal raises spurious ARITHMETIC_OVERFLOW for empty or all-null input in ANSI mode [datafusion-comet] via GitHub
- Re: [I] avg on decimal raises spurious ARITHMETIC_OVERFLOW for empty or all-null input in ANSI mode [datafusion-comet] via GitHub
Re: [I] Proto: migrate ScalarSubqueryExec (adds a subquery-results scope to the decode ctx) [datafusion] via GitHub
[PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
- Re: [PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
- Re: [PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
- Re: [PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
- Re: [PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
- Re: [PR] refactor(proto): move PartitionedFile / FileGroup serde into datafusion-datasource [datafusion] via GitHub
[PR] [QECO-1569] Preserve projected constants in aggregate arguments [datafusion] via GitHub
[PR] fix(parquet): cache deferred page index loads [datafusion] via GitHub
- Re: [PR] fix(parquet): cache deferred page index loads [datafusion] via GitHub
- Re: [PR] fix(parquet): cache deferred page index loads [datafusion] via GitHub
- Re: [PR] fix(parquet): cache deferred page index loads [datafusion] via GitHub
[PR] refactor(proto): put Partitioning / PhysicalSortExpr serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
- Re: [PR] refactor(proto): put Partitioning / sort-expression serde on the types [datafusion] via GitHub
[I] Physical `FilterPushdown` deletes a filter above an anti join when only the probe side absorbs it [datafusion] via GitHub
- Re: [I] Physical `FilterPushdown` deletes a filter above an anti join when only the probe side absorbs it [datafusion] via GitHub
Re: [I] Further optimize Spark hex byte encoding: reuse input NullBuffer and special-case the no-nulls path [datafusion] via GitHub
[PR] refactor: seal the ExecutionPlan proto dispatch traits [datafusion] via GitHub
- Re: [PR] refactor: mark the ExecutionPlan proto dispatch traits as non-public API [datafusion] via GitHub
- Re: [PR] refactor: mark the ExecutionPlan proto dispatch traits as non-public API [datafusion] via GitHub
- Re: [PR] refactor: mark the ExecutionPlan proto dispatch traits as non-public API [datafusion] via GitHub
- Re: [PR] refactor: mark the ExecutionPlan proto dispatch traits as non-public API [datafusion] via GitHub
[PR] fix(physical-plan): preserve Exact(0) in FilterExec for null_count, distinct_count and total_byte_size upon empty input [datafusion] via GitHub
- Re: [PR] fix(physical-plan): preserve Exact(0) in FilterExec for null_count, distinct_count and total_byte_size upon empty input [datafusion] via GitHub
- Re: [PR] fix(physical-plan): preserve Exact(0) in FilterExec for null_count, distinct_count and total_byte_size upon empty input [datafusion] via GitHub
[PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
- Re: [PR] fix(proto): prevent duplicate partition statistics on roundtrip [datafusion] via GitHub
Re: [PR] fix(cli): sort jobs by stage completion ratio [datafusion-ballista] via GitHub
- Re: [PR] fix(cli): sort jobs by stage completion ratio [datafusion-ballista] via GitHub
[I] PartitionedFile statistics grow by one ColumnStatistics per partition column on every protobuf round trip [datafusion] via GitHub
- Re: [I] PartitionedFile statistics grow by one ColumnStatistics per partition column on every protobuf round trip [datafusion] via GitHub
- Re: [I] PartitionedFile statistics grow by one ColumnStatistics per partition column on every protobuf round trip [datafusion] via GitHub
[PR] [QECO-1569] Allow constant expressions in date_part [datafusion] via GitHub
[PR] perf: reduce per-batch fixed cost of native columnar-to-row conversion [datafusion-comet] via GitHub
- Re: [PR] perf: reduce per-batch and per-row cost of native columnar-to-row conversion [datafusion-comet] via GitHub
[PR] fix: [branch-1.0] throw ARITHMETIC_OVERFLOW for Long.MinValue div -1 under ANSI mode (#5084) [datafusion-comet] via GitHub
- Re: [PR] fix: [branch-1.0] throw ARITHMETIC_OVERFLOW for Long.MinValue div -1 under ANSI mode (#5084) [datafusion-comet] via GitHub
[PR] fix: [branch-1.0] honor fail_on_error in native make_decimal (#5080) [datafusion-comet] via GitHub
- Re: [PR] fix: [branch-1.0] honor fail_on_error in native make_decimal (#5080) [datafusion-comet] via GitHub
Re: [I] Performance optimizations for native in-memory cache (follow-on to #4591) [datafusion-comet] via GitHub
- Re: [I] Performance optimizations for native in-memory cache (follow-on to #4591) [datafusion-comet] via GitHub
[I] Make RowsGroupColumn single-field extraction explicit [datafusion] via GitHub
- Re: [I] Make RowsGroupColumn single-field extraction explicit [datafusion] via GitHub
- Re: [I] Make RowsGroupColumn single-field extraction explicit [datafusion] via GitHub
- Re: [I] Make RowsGroupColumn single-field extraction explicit [datafusion] via GitHub
[PR] fix: reject nested arrays in array_distance [datafusion] via GitHub
- Re: [PR] fix: reject nested arrays in array_distance [datafusion] via GitHub
- Re: [PR] fix: reject nested arrays in array_distance [datafusion] via GitHub
[I] Preallocate RowsGroupColumn buffers during take_n rebuilds [datafusion] via GitHub
- Re: [I] Preallocate RowsGroupColumn buffers during take_n rebuilds [datafusion] via GitHub
- Re: [I] Preallocate RowsGroupColumn buffers during take_n rebuilds [datafusion] via GitHub
[I] Specialize GroupColumn storage for Dictionary and Run-End Encoded group keys [datafusion] via GitHub
- Re: [I] Specialize GroupColumn storage for Dictionary and Run-End Encoded group keys [datafusion] via GitHub
[PR] docs: Fixes incorrect type name in `UserDefinedLogicalNode` comment [datafusion] via GitHub
- Re: [PR] docs: Fixes incorrect type name in `UserDefinedLogicalNode` comment [datafusion] via GitHub
- Re: [PR] docs: Fixes incorrect type name in `UserDefinedLogicalNode` comment [datafusion] via GitHub
Re: [PR] PostgreSQL: support INCLUDE on PRIMARY KEY / UNIQUE table constraints [datafusion-sqlparser-rs] via GitHub
Re: [PR] Add visitors for ORDER BY and GROUP BY [datafusion-sqlparser-rs] via GitHub
Re: [PR] PostgreSQL: accept CREATE SEQUENCE options in any clause order [datafusion-sqlparser-rs] via GitHub
- Re: [PR] PostgreSQL: accept CREATE SEQUENCE options in any clause order [datafusion-sqlparser-rs] via GitHub
Re: [PR] feat: support prefix-scoped object stores in the registry [datafusion] via GitHub
Re: [PR] Add XMLPARSE expression [datafusion-sqlparser-rs] via GitHub
- Re: [PR] Add XMLPARSE expression [datafusion-sqlparser-rs] via GitHub
[PR] docs(upgrading): document RecursiveQuery schema field in upgrade guide [datafusion] via GitHub
Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
- Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
- Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
- Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
- Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
- Re: [PR] perf: Concrete TopK array storage [datafusion] via GitHub
[PR] perf(sort-merge): cache current-row bytes in RowValues for SortPreservingMerge [datafusion] via GitHub
- Re: [PR] perf(sort-merge): cache current-row bytes in RowValues for SortPreservingMerge [datafusion] via GitHub

Earlier messages