[
https://issues.apache.org/jira/browse/TEZ-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17034325#comment-17034325
]
László Bodor commented on TEZ-4123:
-----------------------------------
seems like the patch revealed other issues...and I can still see some timeouts:
https://builds.apache.org/job/PreCommit-TEZ-Build/288/artifact/out/patch-unit-tez-tests.txt
{code}
[ERROR] Tests run: 14, Failures: 1, Errors: 5, Skipped: 0, Time elapsed: 554.81
s <<< FAILURE! - in org.apache.tez.mapreduce.TestMRRJobsDAGApi
[ERROR]
testMultipleMRRSleepJobViaSession(org.apache.tez.mapreduce.TestMRRJobsDAGApi)
Time elapsed: 12.752 s <<< FAILURE!
java.lang.AssertionError: expected:<READY> but was:<RUNNING>
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMultipleMRRSleepJobViaSession(TestMRRJobsDAGApi.java:502)
[ERROR] testNonDefaultFSStagingDir(org.apache.tez.mapreduce.TestMRRJobsDAGApi)
Time elapsed: 60.003 s <<< ERROR!
java.lang.Exception: test timed out after 60000 milliseconds
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testNonDefaultFSStagingDir(TestMRRJobsDAGApi.java:246)
[ERROR] testMRRSleepJobDagSubmit(org.apache.tez.mapreduce.TestMRRJobsDAGApi)
Time elapsed: 60.002 s <<< ERROR!
java.lang.Exception: test timed out after 60000 milliseconds
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:701)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:544)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmit(TestMRRJobsDAGApi.java:326)
[ERROR]
testMRRSleepJobDagSubmitAndKillViaRPC(org.apache.tez.mapreduce.TestMRRJobsDAGApi)
Time elapsed: 60.002 s <<< ERROR!
java.lang.Exception: test timed out after 60000 milliseconds
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:747)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:544)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitAndKillViaRPC(TestMRRJobsDAGApi.java:517)
[ERROR]
testMRRSleepJobDagSubmitAndKill(org.apache.tez.mapreduce.TestMRRJobsDAGApi)
Time elapsed: 60.003 s <<< ERROR!
java.lang.Exception: test timed out after 60000 milliseconds
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:701)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:544)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitAndKill(TestMRRJobsDAGApi.java:337)
[ERROR]
testAMRelocalizationConflict(org.apache.tez.mapreduce.TestMRRJobsDAGApi) Time
elapsed: 120.002 s <<< ERROR!
java.lang.Exception: test timed out after 120000 milliseconds
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testMRRSleepJobDagSubmitCore(TestMRRJobsDAGApi.java:747)
at
org.apache.tez.mapreduce.TestMRRJobsDAGApi.testAMRelocalizationConflict(TestMRRJobsDAGApi.java:438)
{code}
I'm considering this as a step forward, because without the patch, I saw
complete class timeout without succeeding tests, e.g. TEZ-4100:
https://builds.apache.org/job/PreCommit-TEZ-Build/279/artifact/out/patch-unit-root.txt
> TestMRRJobsDAGApi flaky timeout - unhealthy node
> ------------------------------------------------
>
> Key: TEZ-4123
> URL: https://issues.apache.org/jira/browse/TEZ-4123
> Project: Apache Tez
> Issue Type: Test
> Reporter: László Bodor
> Assignee: László Bodor
> Priority: Major
> Attachments: TEZ-4123.01.patch, TestMRRJobsDAGApi.out,
> org.apache.tez.mapreduce.TestMRRJobsDAGApi-output.txt
>
>
> Failed in both precommit and on master locally:
> {code}
> mvn clean install -pl ./tez-tests -Dtest=TestMRRJobsDAGApi
> {code}
> surefire process thread dump: [^TestMRRJobsDAGApi.out]
> test output: [^org.apache.tez.mapreduce.TestMRRJobsDAGApi-output.txt]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)