GitHub user watermen opened a pull request:

    https://github.com/apache/spark/pull/7536

    [SPARK-9189][SQL] Takes locality and the sum of partition length into 
account when partition is instance of HadoopPartition in operator coalesce

    Before:
    Takes locality and `the number of partitions` into account in operator 
coalesce.
    
    After:
    Takes locality and `the sum of partition length(part1.len + part2.len + ... 
+ partN.len)` into account when partition is instance of HadoopPartition in 
operator coalesce.
    
    To make the data size of partition more balanced.
    /cc @liancheng @scwf 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/watermen/spark SPARK-9189

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/7536.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #7536
    
----
commit cb72d0f2ce1432ad58246fbeae60c4565fbb4ce7
Author: Yadong Qi <[email protected]>
Date:   2015-07-20T08:28:06Z

    Takes locality and the sum of partition length into account.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to