andygrove opened a new pull request, #4563:
URL: https://github.com/apache/datafusion-comet/pull/4563

   ## Which issue does this PR close?
   
   Closes #.
   
   ## Rationale for this change
   
   The user guide has a `Supported Spark Operators` page, but it only lists the 
operators Comet replaces and does not tell users what is *not* supported or 
what is planned. This mirrors what #4550 did for expressions: turn the 
operators page into a complete, status-aware reference so users can see at a 
glance whether a given Spark physical operator is supported, supported with 
caveats, planned, or not currently planned.
   
   ## What changes are included in this PR?
   
   Rewrites `docs/source/user-guide/latest/operators.md` into a complete 
reference:
   
   - A four-bucket status legend (✅ Supported, ⚠️ Supported with caveats, 🔜 
Planned, 💤 Not currently planned), matching the expression reference.
   - A `Not currently planned` section for operator families that fall back by 
design (Structured Streaming operators, Cartesian / cross joins, sampling and 
range generation).
   - Per-category tables (Scans, Projection and filtering, Sorting and 
limiting, Aggregation, Joins, Exchanges, Window, Generators and set operations, 
Writes, Python and UDF).
   
   Support status is derived from the `createExecEnabledConfig` defaults in 
`CometConf` (for example `window` and `takeOrderedAndProject` are enabled by 
default, `localTableScan` is disabled by default) plus the operator handling in 
`CometExecRule`. Notably this corrects a stale note: `WindowExec` is enabled by 
default now, not disabled.
   
   This is a **draft** because the status of several not-yet-supported 
operators is an initial proposal and would benefit from maintainer 
confirmation, in particular:
   
   - `InMemoryTableScanExec` (marked Planned, could be Not currently planned)
   - `SortAggregateExec` (Planned vs Not currently planned)
   - `WindowGroupLimitExec` and the Python / UDF operators (Planned references)
   
   Open issues / PRs referenced for Planned items: #4429 (nested loop join), 
#4234 (PyArrow UDFs), #4393 (LocalTableScan default), #2721 (window functions).
   
   ## How are these changes tested?
   
   Documentation only. Rendered locally and checked with `prettier`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to