GitHub user mwws opened a pull request:

    https://github.com/apache/spark/pull/8760

    Spark 8426

    1. add a global BlacklistTracker which is able to share blacklisted 
executors information across TaskSetManager. Different BlacklistStrategy could 
be applied to determine which executors should be put in blacklist. And it's 
convenient to add new strategy later according to current design.
    2. remove duplicate "failedExecutors" list in TaskSetManager and make 
BlacklistTracker being the only interface for blacklist mechanism in codebase 
which is logically cleaner.
    3. for backward compatibility, implement DefaultStrategy to keep the same 
semantics as current by default 
    4. apply BlacklistTracker in YARN to avoid allocating new executors on 
blacklisted node, the blacklisted node information will be passed to YARN 
ResourceManager. it affects in dynamic allocation mode.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mwws/spark SPARK-8426

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/8760.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #8760
    
----
commit 8519cb5ca4bbe86885fd7cc09fc9aeaf6fab1436
Author: unknown <[email protected]>
Date:   2015-07-28T05:19:29Z

    change package name to test.org.apache.spark.sql.hive.execution

commit 7c91bb55b9424657409c7e5131cf9b791b9e4a66
Author: unknown <[email protected]>
Date:   2015-09-02T07:54:31Z

    Merge remote-tracking branch 'apache-spark/master'

commit 9a25d71025a411e1fad4190e914465218a1a2d66
Author: mwws <[email protected]>
Date:   2015-09-10T05:29:44Z

    add executor blacklist feature
    
    1. Add blackListTracker and blackListStrategy class
    2. remove deplicated failedExecutor in TaskSetManager
    3. enhance YARN allocator to pass node blacklist info to ResourceManager

commit f648b4c185042ee9d38e11b7f213fe68868c8d72
Author: mwws <[email protected]>
Date:   2015-09-11T05:25:31Z

    some enhancement
    
    1. Add taskId to FailureInfo to be able to contain enough informantion
    for backward compatibility
    2. replace Seq with Set to speedup contains method and remove duplicate
    element by natual
    3. remove executors from failureExecutors when
        (1)any task succeed on the executor
        (2)the executor is removed by ExecutorAllocateClient
    3. Move IsBlacklist() method into BlacklistTracker to make logic cleaner
    4. fit spark code style, eg: lineend, comments start, max length of line

commit 954886f18b3811958b1c2a588f00652d678cf871
Author: mwws <[email protected]>
Date:   2015-09-11T07:47:40Z

    fix compile error
    
    fix compile error

commit 78feafbe23729f2102effec9abc90d6a865ba211
Author: mwws <[email protected]>
Date:   2015-09-14T04:29:08Z

    farther change according to comment
    
    1. rename some varilables and methods
    2. correct FailureExecutors logic in CoarseGrainedSchedulerBackend

commit c84f362b4002eddde22f43c421434f413b04a5ee
Author: mwws <[email protected]>
Date:   2015-09-14T04:36:53Z

    some change to fit scala style
    
    fit scala style

commit 90ee814686e3bf7906fa447b7594aa6b3643de80
Author: mwws <[email protected]>
Date:   2015-09-14T05:22:00Z

    revert incorrect package name change
    
    revert incorrect package name change

commit 413b403ccc577d062b6eae10d995077631d78e97
Author: mwws <[email protected]>
Date:   2015-09-15T05:06:01Z

    change strategy to keep semanetic as original
    
    The original blacklist logic is taskId related, so I change interface
    and add a default strategy to keep same semanetic as it.

commit 7c6eb632fb4132dc053683db617454ce7b335a31
Author: unknown <[email protected]>
Date:   2015-07-28T05:19:29Z

    change package name to test.org.apache.spark.sql.hive.execution

commit b306caecf6d5112c44029ee016654119d3c5185d
Author: mwws <[email protected]>
Date:   2015-09-10T05:29:44Z

    add executor blacklist feature
    
    1. Add blackListTracker and blackListStrategy class
    2. remove deplicated failedExecutor in TaskSetManager
    3. enhance YARN allocator to pass node blacklist info to ResourceManager

commit 395773f1a91d13c4e21bd19bcc56e5a02c9c2784
Author: mwws <[email protected]>
Date:   2015-09-11T05:25:31Z

    some enhancement
    
    1. Add taskId to FailureInfo to be able to contain enough informantion
    for backward compatibility
    2. replace Seq with Set to speedup contains method and remove duplicate
    element by natual
    3. remove executors from failureExecutors when
        (1)any task succeed on the executor
        (2)the executor is removed by ExecutorAllocateClient
    3. Move IsBlacklist() method into BlacklistTracker to make logic cleaner
    4. fit spark code style, eg: lineend, comments start, max length of line

commit a7dfa20fec17de80755edde3b967f9345d8c2834
Author: mwws <[email protected]>
Date:   2015-09-11T07:47:40Z

    fix compile error
    
    fix compile error

commit 94d3a5d90e126d8c95580f2e7f3d5fff6dd7bc4c
Author: mwws <[email protected]>
Date:   2015-09-14T04:29:08Z

    farther change according to comment
    
    1. rename some varilables and methods
    2. correct FailureExecutors logic in CoarseGrainedSchedulerBackend

commit 26fb5a5c41bbbdba793d8ea0c1d40cad299ffedb
Author: mwws <[email protected]>
Date:   2015-09-14T04:36:53Z

    some change to fit scala style
    
    fit scala style

commit f7ecc784788e7b3f0f75a975a781f1af7040bbc4
Author: mwws <[email protected]>
Date:   2015-09-14T05:22:00Z

    revert incorrect package name change
    
    revert incorrect package name change

commit f44d4e134f725dc42b3017bebe20f459b625df8c
Author: mwws <[email protected]>
Date:   2015-09-15T05:06:01Z

    change strategy to keep semanetic as original
    
    The original blacklist logic is taskId related, so I change interface
    and add a default strategy to keep same semanetic as it.

commit c36ea5b44e9a3c4a91bb3625abc3863e2df2aa66
Author: mwws <[email protected]>
Date:   2015-09-15T06:03:46Z

    Merge branch 'SPARK-8426' of https://github.com/mwws/spark into SPARK-8426
    
    Conflicts:
    
    
core/src/main/scala/org/apache/spark/scheduler/cluster/YarnSchedulerBackend.scala
    
    
sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveUDFSuite.scala
    
    yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala

commit 24cdf30d6144d084b5c8ff23984e037bab18b4dd
Author: mwws <[email protected]>
Date:   2015-09-15T06:19:45Z

    fix compile error
    
    fix compile error casued by rebase

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to