andygrove opened a new issue, #1560:
URL: https://github.com/apache/datafusion-comet/issues/1560

   ### What is the problem the feature request solves?
   
   PR https://github.com/apache/datafusion-comet/pull/1525 made improvements to 
the Tuning Guide but there is still incorrect and/or misleading information, 
and some of this was only discovered while working on that PR.
   
   This is a follow-on issue to make improvements to the memory pool 
configuration code, documentation, and tests.
   
   Here are some initial thoughts on areas that need attention:
   
   - When running in off-heap mode we tell users to specify Comet overhead 
using `spark.memory.offHeap.size`. The original goal of using off-heap memory 
was to implement "unified memory management," where the native pool will 
interact with Spark's memory pool so that there is one unified memory pool 
between Spark and Comet. However, we now also have a "fair_unified" pool that 
can be used in off-heap mode and this does not use the "unified memory 
management" approach, so the name is misleading. The pool size can be 
configured in numerous ways and this could perhaps be simplified.
   - There are very few tests for any of the code that computes Comet memory 
overhead or memory pool sizes
   
   
   ### Describe the potential solution
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to