Zongheng Yang created SPARK-2046:
------------------------------------

             Summary: Support config properties that are changeable across 
tasks/stages within a job
                 Key: SPARK-2046
                 URL: https://issues.apache.org/jira/browse/SPARK-2046
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
            Reporter: Zongheng Yang


Suppose an application consists of multiple stages, where some stages contain 
computation-intensive tasks, and other stages contain less 
computation-intensive (or otherwise ordinary) tasks. 

For such job to run efficiently, it might make sense to provide user a function 
to set "spark.task.cpus" to a high number right before the 
computation-intensive stages/tasks are getting generated in the user code, and 
set the property to a lower number for other stages/tasks.

As a first step, supporting this feature across stages instead of the more 
fine-grained task-level might suffice. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to