github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [I] Reduce macOS CI matrix [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] ci: reduce macOS PR matrix to single Spark 4.0 profile [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] ci: reduce macOS PR matrix to single Spark 4.0 profile [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [I] CaseWhen does not work with custom implemented column expression [datafusion]
via GitHub
2026/04/28
[PR] docs: clarify when scalar function serde must set return type [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: Support Spark levenshtein expression in native execution [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Fix metrics for repartition [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] build: add spark-4.2 Maven profile targeting 4.2.0-preview4 [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
[PR] feat(functions-nested): add array_filter higher-order function [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Support Spark levenshtein expression in native execution [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] perf: improve Int64 `generate_series` and `range` performance [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [I] CaseWhen does not work with custom implemented column expression [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Support Spark levenshtein expression in native execution [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [I] Uncancellable FilterExec when the predicate rejects all rows [datafusion]
via GitHub
2026/04/28
[PR] perf: optimize retract_batch for `median` and `percentile_cont` [datafusion]
via GitHub
2026/04/28
Re: [PR] chore: audit array_intersect and expand SQL test coverage [datafusion-comet]
via GitHub
2026/04/28
[I] FIRST/LAST returns wrong result with `PartialMerge` [datafusion-comet]
via GitHub
2026/04/28
[I] Apply spark.comet.exec.strictFloatingPoint to RangePartitioning [datafusion-comet]
via GitHub
2026/04/28
Re: [I] Uncancellable FilterExec when the predicate rejects all rows [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: add MapSort expression support for Spark 4.0 [datafusion-comet]
via GitHub
2026/04/28
Re: [I] Uncancellable FilterExec when the predicate rejects all rows [datafusion]
via GitHub
2026/04/28
[I] Uncancellable FilterExec when the predicate rejects all rows [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: add MapSort expression support for Spark 4.0 [datafusion-comet]
via GitHub
2026/04/28
[PR] Allow pickling PyExpr [datafusion-python]
via GitHub
2026/04/28
Re: [I] Comet 0.15.1 Release [datafusion-comet]
via GitHub
2026/04/28
Re: [I] Iceberg reflection failure [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Add is_null/is_not_true trait methods to PhysicalExpr [datafusion]
via GitHub
2026/04/28
[PR] test: support fallback chain in CometPlanStabilitySuite, dedupe existing goldens [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] [DRAFT] Add lambda substrait support [datafusion]
via GitHub
2026/04/28
[PR] feat: task-level input metrics (bytesRead) for Iceberg native scan [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: support `PartialMerge` aggregation mode [datafusion-comet]
via GitHub
2026/04/28
[PR] WIP: wire `array_exists` on experimental DF branch [datafusion-comet]
via GitHub
2026/04/28
[PR] [WIP] ci: enable PR test matrix and TPCDS plan-stability for Spark 4.2 [datafusion-comet]
via GitHub
2026/04/28
[PR] Snowflake: Fix COPY INTO transformation parsing for cast expressions [datafusion-sqlparser-rs]
via GitHub
2026/04/28
[PR] BigQuery: Parse WITH CONNECTION on CREATE EXTERNAL TABLE [datafusion-sqlparser-rs]
via GitHub
2026/04/28
[PR] Snowflake: Accept COPY GRANTS after CREATE VIEW column list [datafusion-sqlparser-rs]
via GitHub
2026/04/28
Re: [PR] test(python): add datafusion-python compatibility tests [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(python): ignore ballista-namespaced cluster_config keys locally [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Ballista configs cannot be set in Python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
2026/04/28
[PR] Implement UnionCheckVisitor [datafusion]
via GitHub
2026/04/28
Re: [PR] test(python): add datafusion-python compatibility tests [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
[PR] perf: improve Int64 `generate_series` and `range` performance [datafusion]
via GitHub
2026/04/28
[PR] test(python): add datafusion-python compatibility tests [datafusion-ballista]
via GitHub
2026/04/28
Re: [D] DISCUSSION: Boston Datafusion Meetup September 2026 [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: suport spark compatible floor function [datafusion]
via GitHub
2026/04/28
Re: [PR] [datafusion-spark] Add Spark-compatible isnan function [datafusion]
via GitHub
2026/04/28
[PR] [datafusion-spark] Add Spark-compatible isnan function [datafusion]
via GitHub
2026/04/28
Re: [PR] [datafusion-spark] Add Spark-compatible floor function [datafusion]
via GitHub
2026/04/28
Re: [PR] Add SQL based benchmarking harness, port tpch to use framework [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
[PR] fix(python): ignore ballista-namespaced cluster_config keys locally [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] perf: Add `BulkNullStringArrayBuilder` trait, use in `repeat` [datafusion]
via GitHub
2026/04/28
Re: [PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
2026/04/28
Re: [PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: add breaking change detector [datafusion]
via GitHub
2026/04/28
Re: [PR] perf: Optimize `substr_index` to use bulk-NULL string builder [datafusion]
via GitHub
2026/04/28
Re: [PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Python release wheels: macOS x86_64 wheel is missing and arm64 wheel is built twice [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: drop Intel macOS Python wheel build [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Refactor Spark 4.0 and 4.1 shims [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] refactor: consolidate identical spark-4.0 and spark-4.1 shims into spark-4.x [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: drop Intel macOS Python wheel build [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] chore: update python deps to ballista and datafusion 52 [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] chore: update python deps to ballista and datafusion 52 [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] chore: update python deps to ballista and datafusion 52 [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat: defer sort-shuffle materialization with interleave_record_batch [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: drop Intel macOS Python wheel build [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: drop Intel macOS Python wheel build [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Use interleave_record_batch to avoid tiny batches in sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] fix: grouping separator for float and decimal [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: support binary arguments for StringConcat operator [datafusion]
via GitHub
2026/04/28
[PR] ci: drop Intel macOS Python wheel build [datafusion-ballista]
via GitHub
2026/04/28
[I] Adaptive query planner does not support sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/28
[I] Iceberg reflection failure [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] refactor `array_remove` benchmarks & add nested benches [datafusion]
via GitHub
2026/04/28
[PR] docs(release): document PyPI publish process for python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat: Add standalone shuffle writer benchmark that shuffles real Parquet input [datafusion-ballista]
via GitHub
2026/04/28
[I] Add installation instructions to Python user guide [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Create PyPi Release Process [datafusion-ballista]
via GitHub
2026/04/28
[I] Python release wheels: macOS x86_64 wheel is missing and arm64 wheel is built twice [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Comet native sort lacks row-format support for Struct(Map(...)) sort keys [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] docs: improve Python documentation structure [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] docs: improve Python documentation structure [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] docs: improve Python documentation structure [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Ballista configs cannot be set in Python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] docs: improve Python documentation structure [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
[I] Ballista configs cannot be set in Python client [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
[I] Add IO_BLOCK_TRANSPORT support for sort-based shuffle [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] ci: enable Spark 4.1 PR test matrix [datafusion-comet]
via GitHub
2026/04/28
[I] SPARK-53968 SQLViewSuite: decimal arithmetic returns ~10x smaller values through view CTE on Spark 4.1.1 [datafusion-comet]
via GitHub
2026/04/28
[I] Comet native sort lacks row-format support for Struct(Map(...)) sort keys [datafusion-comet]
via GitHub
2026/04/28
[I] EXCEPT ALL / INTERSECT ALL with GROUP BY return incorrect results on Spark 4.1.1 [datafusion-comet]
via GitHub
2026/04/28
[I] Comet native scan rejects invalid UTF-8 byte sequences in STRING column (hll.sql on Spark 4.1) [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
[I] Type annotation requires UDFs to return a type instead of an array [datafusion-python]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] fix: grouping separator for float and decimal [datafusion]
via GitHub
2026/04/28
Re: [PR] Add support for nested types to nullif. [datafusion]
via GitHub
2026/04/28
Re: [I] Add support for nested types to `nullif`. [datafusion]
via GitHub
2026/04/28
Re: [I] Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat: Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat(unparser): Keep inner join `Filter → TableScan` predicates to `WHERE` instead of moving to `JOIN ON` [datafusion]
via GitHub
2026/04/28
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Support Spark levenshtein expression in native execution [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] fix: fix elapsed_compute metric in ParquetSink to report encoding time only [datafusion]
via GitHub
2026/04/28
Re: [I] Potential Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [I] Potential Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
2026/04/28
Re: [PR] chore(deps): bump pbjson-types from 0.8.0 to 0.9.0 in the proto group [datafusion]
via GitHub
2026/04/28
Re: [PR] chore(deps): bump pbjson-types from 0.8.0 to 0.9.0 in the proto group [datafusion]
via GitHub
2026/04/28
[PR] chore(deps): bump libloading from 0.8.9 to 0.9.0 [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: pin JDK per Spark version in Iceberg workflow matrix [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
[PR] chore(deps): update pydata-sphinx-theme requirement from <1,>=0.17.0 to >=0.17.1,<1 in /docs [datafusion]
via GitHub
2026/04/28
[PR] chore(deps): bump pbjson-types from 0.8.0 to 0.9.0 in the proto group [datafusion]
via GitHub
2026/04/28
[PR] chore(deps): bump taiki-e/install-action from 2.75.18 to 2.75.23 [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [I] Blog post about 1000 distinct committers / history of the project [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: LogicalPlanningPipeline [datafusion]
via GitHub
2026/04/28
Re: [PR] feat: Scheduler config update [datafusion-ballista]
via GitHub
2026/04/28
Re: [I] Potential Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [I] [DRAFT, EPIC] Full lambda support [datafusion]
via GitHub
2026/04/28
[PR] Improved multiple column aggregation performance by using bitmasks rather than `Vec<bool>` [datafusion]
via GitHub
2026/04/28
Re: [PR] ci: pin JDK per Spark version in Iceberg workflow matrix [datafusion-comet]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] fix(proto): correctly serialize FilterExec empty projection [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/28
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
Earlier messages
Later messages