andygrove opened a new pull request, #190: URL: https://github.com/apache/datafusion-site/pull/190
## Summary Add a blog post describing the systematic per-category audit of every supported Spark expression in DataFusion Comet against Spark 3.4.3, 3.5.8, 4.0.1, and 4.1.1. The work produced 12 audit PRs ([4469](https://github.com/apache/datafusion-comet/pull/4469), [4470](https://github.com/apache/datafusion-comet/pull/4470), [4473](https://github.com/apache/datafusion-comet/pull/4473), [4474](https://github.com/apache/datafusion-comet/pull/4474), [4475](https://github.com/apache/datafusion-comet/pull/4475), [4476](https://github.com/apache/datafusion-comet/pull/4476), [4478](https://github.com/apache/datafusion-comet/pull/4478), [4479](https://github.com/apache/datafusion-comet/pull/4479), [4480](https://github.com/apache/datafusion-comet/pull/4480), [4483](https://github.com/apache/datafusion-comet/pull/4483), [4486](https://github.com/apache/datafusion-comet/pull/4486), [4493](https://github.com/apache/datafusion-comet/pull/4493)), one skill-update PR ([4468](https://github.com/apache/datafusion-comet/pull/4468)), and 14 tracking issues for higher-risk findings (#4462–#4467, #4471–#4472, #4477, #4481–#4482, #4484–#4485, #4488–#4492). The post covers: - Why an expression audit matters (the four moving parts of a Comet expression that have to stay aligned) - The [`audit-comet-expression`](https://github.com/apache/datafusion-comet/blob/main/.claude/skills/audit-comet-expression/SKILL.md) Claude Code skill that drives the work - The Step 5 consistency rules and the common antipatterns it catches - What the Spark cross-version diff actually looks like in practice - A sample of the 14 tracking issues filed - The parallel-subagent dispatch pattern - The skill-tightening PR that came out of the work Marking this as a draft for review while the audit PRs land. ## Test plan - [ ] Render locally via `make` and verify formatting - [ ] Check that all referenced PR / issue numbers resolve - [ ] Confirm the post appears in the correct chronological place on the index -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
