[jira] [Created] (YARN-10232) InvalidStateTransitionException: Invalid event: LAUNCH_FAILED at RUNNING

2020-04-11 Thread YCozy (Jira)
YCozy created YARN-10232: Summary: InvalidStateTransitionException: Invalid event: LAUNCH_FAILED at RUNNING Key: YARN-10232 URL: https://issues.apache.org/jira/browse/YARN-10232 Project: Hadoop YARN

[jira] [Updated] (YARN-10232) InvalidStateTransitionException: Invalid event: LAUNCH_FAILED at RUNNING

2020-04-11 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-10232: - Description: We were testing YARN under network partition and found the following ERROR in RM's log.

[jira] [Updated] (YARN-10232) InvalidStateTransitionException: Invalid event: LAUNCH_FAILED at RUNNING

2020-04-11 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-10232: - Description: We were testing YARN under network partition and found the following ERROR in RM's log.

[jira] [Created] (YARN-10231) When a NM is partitioned away, YARN service will complain about "Queue's AM resource limit exceeded"

2020-04-10 Thread YCozy (Jira)
YCozy created YARN-10231: Summary: When a NM is partitioned away, YARN service will complain about "Queue's AM resource limit exceeded" Key: YARN-10231 URL: https://issues.apache.org/jira/browse/YARN-10231

[jira] [Created] (YARN-10288) InvalidStateTransitionException: LAUNCH_FAILED at FAILED

2020-05-22 Thread YCozy (Jira)
YCozy created YARN-10288: Summary: InvalidStateTransitionException: LAUNCH_FAILED at FAILED Key: YARN-10288 URL: https://issues.apache.org/jira/browse/YARN-10288 Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-10288) InvalidStateTransitionException: LAUNCH_FAILED at FAILED

2020-05-22 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-10288: - Description: We encountered the following exception when testing YARN (2.10.0) under network partition:

[jira] [Commented] (YARN-9194) Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM

2020-05-22 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-9194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17114123#comment-17114123 ] YCozy commented on YARN-9194: - Hi, we were able to trigger the same bug (LAUNCH_FAILED at FAILED) in 2.10.0.

[jira] [Issue Comment Deleted] (YARN-9194) Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM

2020-05-22 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-9194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-9194: Comment: was deleted (was: Hi, we were able to trigger the same bug (LAUNCH_FAILED at FAILED) in 2.10.0. Can we

[jira] [Updated] (YARN-10294) NodeManager shows a wrong reason when a YARN service fails to start

2020-05-28 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-10294: - Description: We have a YARN cluster and try to start a sleeper service. A NodeManager NM1 gets assigned and

[jira] [Created] (YARN-10294) NodeManager shows a wrong reason when a YARN service fails to start

2020-05-28 Thread YCozy (Jira)
YCozy created YARN-10294: Summary: NodeManager shows a wrong reason when a YARN service fails to start Key: YARN-10294 URL: https://issues.apache.org/jira/browse/YARN-10294 Project: Hadoop YARN

[jira] [Commented] (YARN-10166) Add detail log for ApplicationAttemptNotFoundException

2020-05-29 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17119886#comment-17119886 ] YCozy commented on YARN-10166: -- We encountered the same issue. An AM is killed during NM failover, but the

[jira] [Updated] (YARN-10301) "DIGEST-MD5: digest response format violation. Mismatched response." when network partition occurs

2020-06-01 Thread YCozy (Jira)
[ https://issues.apache.org/jira/browse/YARN-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YCozy updated YARN-10301: - Description: We observed the "Mismatched response." error in RM's log when a NM gets network-partitioned after

[jira] [Created] (YARN-10301) "DIGEST-MD5: digest response format violation. Mismatched response." when network partition occurs

2020-06-01 Thread YCozy (Jira)
YCozy created YARN-10301: Summary: "DIGEST-MD5: digest response format violation. Mismatched response." when network partition occurs Key: YARN-10301 URL: https://issues.apache.org/jira/browse/YARN-10301