Ngone51 commented on a change in pull request #28053: [SPARK-29153][CORE]Add
ability to merge resource profiles within a stage with Stage Level Scheduling
URL: https://github.com/apache/spark/pull/28053#discussion_r399977107
##########
File path: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala
##########
@@ -447,10 +449,28 @@ private[spark] class DAGScheduler(
stageResourceProfiles: HashSet[ResourceProfile]): ResourceProfile = {
logDebug(s"Merging stage rdd profiles: $stageResourceProfiles")
val resourceProfile = if (stageResourceProfiles.size > 1) {
- // add option later to actually merge profiles - SPARK-29153
- throw new IllegalArgumentException("Multiple ResourceProfile's specified
in the RDDs for " +
- "this stage, please resolve the conflicting ResourceProfile's as Spark
doesn't" +
- "currently support merging them.")
+ if (shouldMergeResourceProfiles) {
+ var mergedProfile: ResourceProfile = stageResourceProfiles.head
+ for (profile <- stageResourceProfiles.drop(1)) {
+ mergedProfile = mergeResourceProfiles(mergedProfile, profile)
+ }
Review comment:
Maybe, using `fold`?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]