Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17543
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/75554/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17543
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/17543
**[Test build #75554 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/75554/testReport)**
for PR 17543 at commit
Github user brkyvz commented on the issue:
https://github.com/apache/spark/pull/17543
That JIRA is great. I'll close this PR for now and link my JIRA in there.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user kayousterhout commented on the issue:
https://github.com/apache/spark/pull/17543
In theory (as you may know), the way this is supposed to work is that,
since each reduce task reads the map outputs in random order, we delay
re-scheduling the earlier stage, to try to
Github user brkyvz commented on the issue:
https://github.com/apache/spark/pull/17543
Let me try to draw a graph to better explain this.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user brkyvz commented on the issue:
https://github.com/apache/spark/pull/17543
Yes, your explanation is on point. If I have 4+ executors that died, then
all retries of Stage B will also eventually fail. If we didn't ignore these
failures, we could have re-computed the outputs
Github user brkyvz commented on the issue:
https://github.com/apache/spark/pull/17543
cc @kayousterhout @markhamstra for feedback.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/17543
**[Test build #75554 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/75554/testReport)**
for PR 17543 at commit