NGA-TRAN commented on issue #17718:
URL: https://github.com/apache/datafusion/issues/17718#issuecomment-3404142193
Thanks @alamb for pointing me to that great discussion.
My proposal won’t dive into the details of LogicalPlan, PhysicalPlan, or
Rust APIs/traits. Instead, it aims to provide a high-level but comprehensive
view, including:
1. A Join Graph for TPC-H Query 5
2. Join Order Enumeration for Q5 using **abstract** `Join Ranking` and `Plan
Pruning`
3. A customizable Join Ranking example for API design, considering:
- Input data partitioning
- Input sort order
- Join relationships (1:m vs m:n)
- Table selectivity
- (And possibly more such as sortedness of gby & orderby columns)
4. Plan Pruning: examples using rules or an abstract cost model
5. A simple, illustrative Cost Model example
The goal is to spark deeper discussion and help break this epic into
focused, actionable tickets.
@alamb @JanKaul @LorrensP-2158466 @adriangb @jonathanc-n @2010YOUY01 — let
me know if you’re interested in this write-up, or if your current needs are
already covered.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]