serdar onur created SPARK-31649:
-----------------------------------

             Summary: Spread partitions evenly to spark executors
                 Key: SPARK-31649
                 URL: https://issues.apache.org/jira/browse/SPARK-31649
             Project: Spark
          Issue Type: New Feature
          Components: Spark Core
    Affects Versions: 2.4.4
            Reporter: serdar onur


The year is 2020 and I am still trying to find a solution to this. I totally 
understand what [~thunderstumpges] was trying to achieve and I am trying to 
achieve the same. For a tool like spark, it is unacceptable not to be able to 
distribute the created partitions to the executors evenly. You know, we can 
create a custom partitioner to distribute the data to the partitions evenly by 
creating our own partition index. I was under the impression that a similar 
approach could be applied to spread these partitions to the executors 
evenly(using some sort of executor index for selection of executors during 
partition distribution). I have been googling this for a day now and I am very 
disappointed to say that up to now this seems to be not possible.

Note: I am disappointed that the issue below was put into resolved state 
without actually doing anything about it.

https://issues.apache.org/jira/browse/SPARK-19371



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to