zhztheplayer commented on issue #8018:
URL: 
https://github.com/apache/incubator-gluten/issues/8018#issuecomment-2496912467

   > also we can calculate the shuffle status complexity to roughly estimate 
mapStatus memory occupation. The rule works in follwing steps
   
   @zjuwangg I think this part is comparatively tricker than other. Do you 
think you can start from an individual PR which adds an utility / API to do 
resource estimation on query plans (if the idea aligns with your approach, I am 
not sure)? Then in subsequent PRs we can adopt this API in Gluten and 
whole-stage transformer for remaining work.
   
   Moreover, is the feature targeted more for batch query scenarios (ETL, 
nightly, etc.)? Since I remember changing a Spark resource profile usually 
causes rebooting of executors, which will cause larger latency on ad-hoc 
queries?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to