(incubator-gluten) branch main updated: [DOC] Document Stage Level Resource Profile Adjustment feature (#8908)

felixybw Fri, 07 Mar 2025 23:03:11 -0800

This is an automated email from the ASF dual-hosted git repository.

felixybw pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/incubator-gluten.git



The following commit(s) were added to refs/heads/main by this push:
     new 77b616029f [DOC] Document Stage Level Resource Profile Adjustment 
feature (#8908)
77b616029f is described below

commit 77b616029febffe1419056a8cb7dcbc631bcfc20
Author: Terry Wang <[email protected]>
AuthorDate: Sat Mar 8 15:03:02 2025 +0800

    [DOC] Document Stage Level Resource Profile Adjustment feature (#8908)
    
    Add document about how to use StageLevelResource Auto adjust introduced in 
#8209
---
 docs/Configuration.md                     |   3 ++
 docs/get-started/Velox.md                 |   3 ++
 docs/get-started/VeloxStageResourceAdj.md |  74 ++++++++++++++++++++++++++++++
 docs/image/velox_apply_stage_resource.png | Bin 0 -> 124896 bytes
 4 files changed, 80 insertions(+)

diff --git a/docs/Configuration.md b/docs/Configuration.md
index 6c452ef86b..1741d3e34e 100644
--- a/docs/Configuration.md
+++ b/docs/Configuration.md
@@ -99,6 +99,9 @@ The following configurations are related to Velox settings.
 | spark.gluten.sql.columnar.backend.velox.orc.scan.enabled             | 
Enable velox orc scan. If disabled, vanilla spark orc scan will be used.        
                                                                   | true       
       |
 | spark.gluten.sql.complexType.scan.fallback.enabled                   | Force 
fallback for complex type scan, including struct, map, array.                   
                                                             | true             
 |
 | spark.gluten.velox.offHeapBroadcastBuildRelation.enabled             | 
Experimental: If enabled, broadcast build relation will use offheap memory. 
Otherwise, broadcast build relation will use onheap memory, default value is 
false |                   |
+| spark.gluten.auto.adjustStageResource.enabled                        | 
Experimental: If enabled, gluten will try to set the stage resource according 
to stage execution plan. NOTE: Only workes when aqe is enabled at the same 
time. | false   |
+| spark.gluten.auto.adjustStageResources.heap.ratio                    | 
Experimental: Increase executor heap memory when match adjust stage resource 
rule.                                                                        | 
2.0d    |
+| spark.gluten.auto.adjustStageResources.fallenNode.ratio.threshold    | 
Experimental: Increase executor heap memory when stage contains fallen node 
count exceeds the total node count ratio.                                     | 
0.5d    |
 
 Additionally, you can control the configurations of gluten at thread level by 
local property.
 
diff --git a/docs/get-started/Velox.md b/docs/get-started/Velox.md
index c3a9aa403f..4f922edd75 100644
--- a/docs/get-started/Velox.md
+++ b/docs/get-started/Velox.md
@@ -546,6 +546,9 @@ I20231121 10:19:42.348845 90094332 
WholeStageResultIterator.cc:220] Native Plan
 ```
 
 
+## Using Stage-Level Resource Adjustment to Avoid OOM(Experimental)
+ see more [here](../VeloxStageResourceAdj.md)
+
 ## Broadcast Build Relations to Off-Heap(Experimental)
 
 The experimental feature **Off-Heap Broadcast Build Relations** aims to 
mitigate out-of-memory (OOM) issues caused by heap memory consumption during 
broadcast operations. Detailed design
diff --git a/docs/get-started/VeloxStageResourceAdj.md 
b/docs/get-started/VeloxStageResourceAdj.md
new file mode 100644
index 0000000000..c9c889e95b
--- /dev/null
+++ b/docs/get-started/VeloxStageResourceAdj.md
@@ -0,0 +1,74 @@
+---
+layout: page
+title: Stage-Level Resource Adjustment in Velox Backend
+nav_order: 3
+parent: Getting-Started
+---
+## Using Stage-Level Resource Adjustment to Avoid OOM(Experimental)
+
+### **Overview**
+One major advantage of Apache Gluten is its ability to significantly reduce 
memory requirements per executor—potentially by up to half—when entire stages 
are offloaded to the native engine. This engine primarily relies on off-heap 
memory with minimal on-heap usage. However, when stages contain fallback 
operators that utilize the JVM engine, the on-heap memory size must be 
increased, leading to even higher memory demands per executor. This challenge 
has posed significant barriers during t [...]
+
+To address this issue, Apache Gluten introduces a stage-level resource 
auto-adjustment framework. This feature dynamically optimizes task and executor 
resource profiles, such as heap and off-heap memory allocation, based on the 
specific characteristics of each stage, including the presence of fallback 
operators. Additionally, this framework is designed with future enhancements in 
mind, allowing for adjustments to accommodate other requirements, such as heavy 
shuffle workloads(to be suppo [...]
+
+### **Prerequisites**
+1. **Enable Adaptive Query Execution (AQE)**:
+   ```properties  
+   spark.sql.adaptive.enabled=true  
+   ```  
+2. **Enable Executor Dynamic Allocation**:
+   ```properties  
+   spark.dynamicAllocation.enabled=true  
+   ```  
+3. **Resource Scheduler Compatibility**:  
+   Ensure the underlying cluster resource manager (e.g., YARN, Kubernetes) 
supports dynamic resource allocation.
+
+### **Key Configurations**
+Add the following configurations to your Spark application:
+
+
+| Parameters                                                        | 
Description                                                                     
                                                                              | 
Default |
+|-------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------|---------|
+| spark.gluten.auto.adjustStageResource.enabled                     | 
Experimental: If enabled, gluten will try to set the stage resource according 
to stage execution plan. NOTE: Only works when aqe is enabled at the same time. 
| false   |
+| spark.gluten.auto.adjustStageResources.heap.ratio                 | 
Experimental: Increase executor heap memory when match adjust stage resource 
rule.                                                                           
 | 2.0d    |
+| spark.gluten.auto.adjustStageResources.fallenNode.ratio.threshold | 
Experimental: Increase executor heap memory when stage contains fallen node 
count exceeds the total node count ratio.                                       
  | 0.5d    |
+#### **1. Enable Auto-Adjustment**
+```properties  
+spark.gluten.auto.AdjustStageResource.enabled=true  
+```
+### **How It Works**
+The framework analyzes each stage during query planning and adjusts resource 
profiles in following scenarios:
+
+#### **Scenario 1: Fallback Operators Exist**
+If a stage all operator fallback to vanilla Spark operator or  fallback 
operators (e.g., unsupported UDAFs) ratio exceed specified threshold, Gluten 
will automic increases heap memory allocation to handle the extra load.
+
+
+### **Verification**
+1. **Check Logs**:  
+   Look for driver log entries indicating resource profile adjustments:
+   ```  
+   Apply resource profile [RP_ID] for plan [plan node name]  
+   ```  
+
+2. **Check SparkUI SQL Tab**  
+
+There will be a ApplyResourceProfile node in the Query details.
+![SQL_DETAIL](../image/velox_apply_stage_resource.png)
+And the execution plan will like following with ApplyResourceProfile node 
inserted.
+```
++- *(3) HashAggregate(keys=[_nondeterministic#37], functions=[count(1)], 
output=[java_method(java.lang.Integer, signum, c1)#35, count(1)#36L])
+            +- AQEShuffleRead coalesced
+               +- ShuffleQueryStage 0
+                  +- Exchange hashpartitioning(_nondeterministic#37, 5), 
ENSURE_REQUIREMENTS, [plan_id=607]
+                     +- ApplyResourceProfile Profile: id = 0, executor 
resources: cores -> name: cores, amount: 1, script: , vendor: ,memory -> name: 
memory, amount: 1024, script: , vendor: ,offHeap -> name: offHeap, amount: 
2048, script: , vendor: , task resources: cpus -> name: cpus, amount: 1.0
+                        +- *(2) HashAggregate(keys=[_nondeterministic#37], 
functions=[partial_count(1)], output=[_nondeterministic#37, count#41L])
+                           +- Project [java_method(java.lang.Integer, signum, 
c1#22) AS _nondeterministic#37]
+                              +- *(1) ColumnarToRow
+                                 +- FileScan parquet default.tmp1[c1#22] 
Batched: true, DataFilters: [], Format: Parquet
+```
+   
+### **Limitations**
+• Tested with YARN/Kubernetes; other resource managers may need validation.
+
+
+For issues or feedback, refer to 
[GLUTEN-8018](https://github.com/apache/incubator-gluten/issues/8018).
\ No newline at end of file
diff --git a/docs/image/velox_apply_stage_resource.png 
b/docs/image/velox_apply_stage_resource.png
new file mode 100644
index 0000000000..c9fbb2f8ca
Binary files /dev/null and b/docs/image/velox_apply_stage_resource.png differ


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(incubator-gluten) branch main updated: [DOC] Document Stage Level Resource Profile Adjustment feature (#8908)

Reply via email to