[
https://issues.apache.org/jira/browse/SPARK-8987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-8987:
-------------------------------
Target Version/s: 1.6.0 (was: 1.5.0)
> Increase test coverage of DAGScheduler
> --------------------------------------
>
> Key: SPARK-8987
> URL: https://issues.apache.org/jira/browse/SPARK-8987
> Project: Spark
> Issue Type: Bug
> Components: Scheduler, Tests
> Affects Versions: 1.0.0
> Reporter: Andrew Or
>
> DAGScheduler is one of the most monstrous piece of code in Spark. Every time
> someone changes something there something like the following happens:
> (1) Someone pings a committer
> (2) The committer pings a scheduler maintainer
> (3) Scheduler maintainer correctly points out bugs in the patch
> (4) Author of patch fixes bug but introduces more bugs
> (5) Repeat steps 3 - 4 N times
> (6) Other committers / contributors jump in and start debating
> (7) The patch goes stale for months
> All of this happens because no one, including the committers, has high
> confidence that a particular change doesn't break some corner case in the
> scheduler. I believe one of the main issues is the lack of sufficient test
> coverage, which is not a luxury but a necessity for logic as complex as the
> DAGScheduler.
> As of the writing of this JIRA, DAGScheduler has ~1500 lines, while the
> DAGSchedulerSuite only has ~900 lines. I would argue that the suite line
> count should actually be many multiples of that of the original code.
> If you wish to work on this, let me know and I will assign it to you. Anyone
> is welcome. :)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]