[jira] [Created] (YARN-11586) Script based sub cluster resolver

2023-10-07 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11586: -- Summary: Script based sub cluster resolver Key: YARN-11586 URL: https://issues.apache.org/jira/browse/YARN-11586 Project: Hadoop YARN Issue Type: Improvement

[jira] [Commented] (YARN-10174) Add colored policies to enable manual load balancing across sub clusters

2023-09-26 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17769401#comment-17769401 ] zhengchenyu commented on YARN-10174: [~slfan1989] Thanks for your reply. Adjusting pa

[jira] [Commented] (YARN-10174) Add colored policies to enable manual load balancing across sub clusters

2023-09-17 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17766228#comment-17766228 ] zhengchenyu commented on YARN-10174: I think [~youchen] means that we can get weights

[jira] [Updated] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-15 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11566: --- Issue Type: Bug (was: Improvement) > Yarn app kill command can not kill the application in secondary

[jira] [Comment Edited] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17764465#comment-17764465 ] zhengchenyu edited comment on YARN-11566 at 9/14/23 10:54 AM: -

[jira] [Updated] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11566: --- Description: When AMRMProxy is enable, the application may allocate container among multi sub cluste

[jira] [Comment Edited] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17764465#comment-17764465 ] zhengchenyu edited comment on YARN-11566 at 9/13/23 10:00 AM: -

[jira] [Commented] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-12 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17764465#comment-17764465 ] zhengchenyu commented on YARN-11566: There are two ways to solve this problem: (1) C

[jira] [Updated] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-11 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11566: --- Description: When AMRMProxy is enable, the application may allocate container among multi sub cluster

[jira] [Updated] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-11 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11566: --- Description: When AMRMProxy is enable, the application may allocate container among multi sub cluster

[jira] [Created] (YARN-11566) Yarn app kill command can not kill the application in secondary sub cluster.

2023-09-11 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11566: -- Summary: Yarn app kill command can not kill the application in secondary sub cluster. Key: YARN-11566 URL: https://issues.apache.org/jira/browse/YARN-11566 Project: Hadoo

[jira] [Comment Edited] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17763087#comment-17763087 ] zhengchenyu edited comment on YARN-11565 at 9/8/23 1:34 PM: T

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Target Version/s: (was: 3.4.0) > Container logs are missing when yarn.app.container.log.filesize is

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Labels: (was: pull-request-available) > Container logs are missing when yarn.app.container.log.file

[jira] [Resolved] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu resolved YARN-11565. Resolution: Duplicate This Jira should be in mapreduce module. > Container logs are missing when y

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, in container-log4j.properties, log4j.appender.\{APPENDER}.MaxFileSi

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, in container-log4j.properties, log4j.appender.\{APPENDER}.MaxFileSi

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, in container-log4j.properties, log4j.appender.\{APPENDER}.MaxFileSi

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, in container-log4j.properties, log4j.appender.\{APPENDER}.MaxFileSi

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, log4j.appender.\{APPENDER}.MaxFileSize is set to ${yarn.app.contain

[jira] [Updated] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11565: --- Description: Since HADOOP-18649, log4j.appender.\{APPENDER}.MaxFileSize is set to ${yarn.app.contain

[jira] [Created] (YARN-11565) Container logs are missing when yarn.app.container.log.filesize is set to default value 0.

2023-09-08 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11565: -- Summary: Container logs are missing when yarn.app.container.log.filesize is set to default value 0. Key: YARN-11565 URL: https://issues.apache.org/jira/browse/YARN-11565

[jira] [Updated] (YARN-11564) Fix wrong config in yarn-default.xml

2023-09-07 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11564: --- Summary: Fix wrong config in yarn-default.xml (was: The default value of sub cluster cleaner interva

[jira] [Updated] (YARN-11564) Fix wrong config in yarn-default.xml

2023-09-07 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11564: --- Description: yarn.router.subcluster.cleaner.interval.time is duplicated in yarn-default.xml (was: So

[jira] [Created] (YARN-11564) The default value of sub cluster cleaner interval was converted unexpectedly

2023-09-07 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11564: -- Summary: The default value of sub cluster cleaner interval was converted unexpectedly Key: YARN-11564 URL: https://issues.apache.org/jira/browse/YARN-11564 Project: Hadoo

[jira] [Assigned] (YARN-8980) Mapreduce application container start fail after AM restart.

2023-08-22 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-8980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu reassigned YARN-8980: - Assignee: zhengchenyu (was: Shilun Fan) > Mapreduce application container start fail after AM r

[jira] [Updated] (YARN-11153) Make proxy server support YARN federation.

2023-08-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-11153) Make proxy server support YARN federation.

2023-08-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Attachment: YARN-10775-design-doc.001.pdf > Make proxy server support YARN federation. >

[jira] [Updated] (YARN-11153) Make proxy server support YARN federation.

2023-08-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-11154) Make router support proxy server.

2023-08-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Parent Issue: YARN-5597 (was: YARN-10775) > Make router support proxy server. >

[jira] [Updated] (YARN-11153) Make proxy server support YARN federation.

2023-08-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Parent Issue: YARN-5597 (was: YARN-10775) > Make proxy server support YARN federation. > ---

[jira] [Created] (YARN-11549) Add MiniRouterYarnCluster for test

2023-08-12 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11549: -- Summary: Add MiniRouterYarnCluster for test Key: YARN-11549 URL: https://issues.apache.org/jira/browse/YARN-11549 Project: Hadoop YARN Issue Type: Improvement

[jira] [Commented] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-11-08 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17630300#comment-17630300 ] zhengchenyu commented on YARN-11183: [~goiri]  Hi, can you please review this PR? the

[jira] [Updated] (YARN-5936) when cpu strict mode is closed, yarn couldn't assure scheduling fairness between containers

2022-09-02 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-5936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-5936: -- Target Version/s: (was: 2.7.1) > when cpu strict mode is closed, yarn couldn't assure scheduling fairn

[jira] [Resolved] (YARN-5936) when cpu strict mode is closed, yarn couldn't assure scheduling fairness between containers

2022-09-02 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-5936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu resolved YARN-5936. --- Resolution: Not A Problem > when cpu strict mode is closed, yarn couldn't assure scheduling fairness

[jira] [Commented] (YARN-5936) when cpu strict mode is closed, yarn couldn't assure scheduling fairness between containers

2022-09-02 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-5936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599379#comment-17599379 ] zhengchenyu commented on YARN-5936: --- For work change, I miss long long time. In fact, w

[jira] [Comment Edited] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-06-17 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554030#comment-17554030 ] zhengchenyu edited comment on YARN-11183 at 6/17/22 9:18 AM: -

[jira] [Comment Edited] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554030#comment-17554030 ] zhengchenyu edited comment on YARN-11183 at 6/14/22 11:17 AM: -

[jira] [Commented] (YARN-11154) Make router support proxy server.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554034#comment-17554034 ] zhengchenyu commented on YARN-11154: [~slfan1989] Hi, I submit a draft patch firstly.

[jira] [Updated] (YARN-11154) Make router support proxy server.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Attachment: YARN-11154.draft.patch > Make router support proxy server. >

[jira] [Comment Edited] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554030#comment-17554030 ] zhengchenyu edited comment on YARN-11183 at 6/14/22 10:49 AM: -

[jira] [Commented] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554030#comment-17554030 ] zhengchenyu commented on YARN-11183: In our first version, I remove ApplicationHomeSu

[jira] [Created] (YARN-11183) Federation: Remove outdated ApplicationHomeSubCluster in federation state store.

2022-06-14 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11183: -- Summary: Federation: Remove outdated ApplicationHomeSubCluster in federation state store. Key: YARN-11183 URL: https://issues.apache.org/jira/browse/YARN-11183 Project: H

[jira] [Updated] (YARN-11154) Make router support proxy server.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Attachment: (was: YARN-11154.draft.patch) > Make router support proxy server. > -

[jira] [Updated] (YARN-11154) Make router support proxy server.

2022-06-14 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Attachment: YARN-11154.draft.patch > Make router support proxy server. >

[jira] [Assigned] (YARN-11172) Fix testDelegationToken

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu reassigned YARN-11172: -- Assignee: zhengchenyu > Fix testDelegationToken > --- > >

[jira] [Created] (YARN-11172) Fix testDelegationToken

2022-06-06 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11172: -- Summary: Fix testDelegationToken Key: YARN-11172 URL: https://issues.apache.org/jira/browse/YARN-11172 Project: Hadoop YARN Issue Type: Improvement R

[jira] [Commented] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550401#comment-17550401 ] zhengchenyu commented on YARN-10775: [~slfan1989] In fact, the picture of chapter 3 h

[jira] [Commented] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550363#comment-17550363 ] zhengchenyu commented on YARN-10775: Thanks for review and suggestion. Welcome contin

[jira] [Commented] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550355#comment-17550355 ] zhengchenyu commented on YARN-10775: In our cluster, the final version, I don't regar

[jira] [Commented] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550349#comment-17550349 ] zhengchenyu commented on YARN-11127: [~slfan1989] Thanks for review. In fact, fix an

[jira] [Comment Edited] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550343#comment-17550343 ] zhengchenyu edited comment on YARN-10775 at 6/6/22 8:36 AM: [

[jira] [Commented] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550343#comment-17550343 ] zhengchenyu commented on YARN-10775: I think you need read my answer and document aga

[jira] [Commented] (YARN-10775) Federation: YARN running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-06-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550333#comment-17550333 ] zhengchenyu commented on YARN-10775: [~slfan1989]  Answer 1: This operation impleme

[jira] [Updated] (YARN-11153) Make proxy server support yarn federation.

2022-06-05 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Parent: YARN-10775 Issue Type: Sub-task (was: Improvement) > Make proxy server support yarn

[jira] [Updated] (YARN-11154) Make router support proxy server.

2022-06-05 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Parent: YARN-10775 Issue Type: Sub-task (was: Improvement) > Make router support proxy serve

[jira] [Commented] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-19 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539363#comment-17539363 ] zhengchenyu commented on YARN-11127: Thanks [~hexiaoqiao] . Maybe it is a low probabi

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Comment Edited] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537403#comment-17537403 ] zhengchenyu edited comment on YARN-10775 at 5/16/22 8:49 AM: -

[jira] [Comment Edited] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17532828#comment-17532828 ] zhengchenyu edited comment on YARN-11127 at 5/16/22 8:45 AM: -

[jira] [Commented] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537403#comment-17537403 ] zhengchenyu commented on YARN-10775: [~inigoiri]  [~snemeth] [~ayushsaxena] [~bteke]

[jira] [Comment Edited] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537403#comment-17537403 ] zhengchenyu edited comment on YARN-10775 at 5/16/22 8:44 AM: -

[jira] [Updated] (YARN-11153) Make proxy server support yarn federation.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Description: Detail message see: https://issues.apache.org/jira/browse/YARN-10775 and YARN-10775-des

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-11154) Make router support proxy server.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11154: --- Description: Detail message see: https://issues.apache.org/jira/browse/YARN-10775 and YARN-10775-des

[jira] [Updated] (YARN-11153) Make proxy server support yarn federation.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11153: --- Description: Detail message see: https://issues.apache.org/jira/browse/YARN-10775 and  > Make proxy

[jira] [Commented] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537392#comment-17537392 ] zhengchenyu commented on YARN-10775: YARN-10786 describe same problem, but have two p

[jira] [Created] (YARN-11154) Make router support proxy server.

2022-05-16 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11154: -- Summary: Make router support proxy server. Key: YARN-11154 URL: https://issues.apache.org/jira/browse/YARN-11154 Project: Hadoop YARN Issue Type: Improvement

[jira] [Created] (YARN-11153) Make proxy server support yarn federation.

2022-05-16 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11153: -- Summary: Make proxy server support yarn federation. Key: YARN-11153 URL: https://issues.apache.org/jira/browse/YARN-11153 Project: Hadoop YARN Issue Type: Improv

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Attachment: YARN-10775-design-doc.001.pdf > Federation: Yarn running app web can't be unable to conne

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Commented] (YARN-6539) Create SecureLogin inside Router

2022-05-16 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-6539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537385#comment-17537385 ] zhengchenyu commented on YARN-6539: --- Any new progress about this? I have apply this patc

[jira] [Updated] (YARN-11148) In federation and security mode, nm recover may fail.

2022-05-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11148: --- Description: In federation yarn cluster, security is enable, nm recovery is enable, nm restart may f

[jira] [Updated] (YARN-11148) In federation and security mode, nm recover may fail.

2022-05-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11148: --- Description: Exception stack {code:java} 2022-05-08 00:44:11,536 WARN org.apache.hadoop.ipc.Client: E

[jira] [Updated] (YARN-11148) In federation and security mode, nm recover may fail.

2022-05-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11148: --- Description: Exception stack {code:java} 2022-05-08 00:44:11,536 WARN org.apache.hadoop.ipc.Client: E

[jira] [Created] (YARN-11148) In federation and security mode, nm recover may fail.

2022-05-13 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11148: -- Summary: In federation and security mode, nm recover may fail. Key: YARN-11148 URL: https://issues.apache.org/jira/browse/YARN-11148 Project: Hadoop YARN Issue T

[jira] [Updated] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2022-05-13 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Description: I setup a yarn federation cluster, I can't connect the running app web, but the complet

[jira] [Updated] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-07 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11127: --- Description: I found rm deadlock in our cluster. It's a low probability event. some critical jstack

[jira] [Commented] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17533204#comment-17533204 ] zhengchenyu commented on YARN-11127: Another problem is that When dispatcher thread s

[jira] [Commented] (YARN-11132) RM failover may fail when Dispatcher stuck.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17533203#comment-17533203 ] zhengchenyu commented on YARN-11132: I think we could watch the head element of event

[jira] [Created] (YARN-11132) RM failover may fail when Dispatcher stuck.

2022-05-06 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11132: -- Summary: RM failover may fail when Dispatcher stuck. Key: YARN-11132 URL: https://issues.apache.org/jira/browse/YARN-11132 Project: Hadoop YARN Issue Type: Impro

[jira] [Commented] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17532828#comment-17532828 ] zhengchenyu commented on YARN-11127: [~vinodkv] [~bteke]  [~pbacsko]  [~bilwa_st] [~z

[jira] [Comment Edited] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17532824#comment-17532824 ] zhengchenyu edited comment on YARN-11127 at 5/6/22 12:07 PM: -

[jira] [Commented] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17532824#comment-17532824 ] zhengchenyu commented on YARN-11127: aggregateLogReport introduce by YARN-1376 then t

[jira] [Updated] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11127: --- Description: I found rm deadlock in our cluster. It's a low probability event. some critical jstack

[jira] [Updated] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11127: --- Description: I found rm deadlock in our cluster. It's a low probability event. some critical jstack

[jira] [Updated] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-11127: --- Description: I found rm deadlock in our cluster. It's a low probability event. some critical jstack

[jira] [Created] (YARN-11127) Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention.

2022-05-06 Thread zhengchenyu (Jira)
zhengchenyu created YARN-11127: -- Summary: Potential deadlock in AsyncDispatcher caused by RMNodeImpl, SchedulerApplicationAttempt and RMAppImpl's lock contention. Key: YARN-11127 URL: https://issues.apache.org/jira/b

[jira] [Issue Comment Deleted] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2021-05-31 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu updated YARN-10775: --- Comment: was deleted (was: I think maybe we need to construct a proxy server in nm to proxy am's web

[jira] [Commented] (YARN-10786) Federation:We can't access the AM page while using federation

2021-05-31 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17354292#comment-17354292 ] zhengchenyu commented on YARN-10786: I don't think it's a good way to solve this prob

[jira] [Commented] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2021-05-31 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17354290#comment-17354290 ] zhengchenyu commented on YARN-10775: I think maybe we need to construct a proxy serve

[jira] [Created] (YARN-10776) Make ConfiguredRMFailoverProxyProvider select ResourceManager or Router randomly

2021-05-18 Thread zhengchenyu (Jira)
zhengchenyu created YARN-10776: -- Summary: Make ConfiguredRMFailoverProxyProvider select ResourceManager or Router randomly Key: YARN-10776 URL: https://issues.apache.org/jira/browse/YARN-10776 Project: H

[jira] [Assigned] (YARN-10776) Make ConfiguredRMFailoverProxyProvider select ResourceManager or Router randomly

2021-05-18 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu reassigned YARN-10776: -- Assignee: zhengchenyu > Make ConfiguredRMFailoverProxyProvider select ResourceManager or Route

[jira] [Assigned] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2021-05-18 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengchenyu reassigned YARN-10775: -- Assignee: zhengchenyu > Federation: Yarn running app web can't be unable to connect, because

[jira] [Created] (YARN-10775) Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address.

2021-05-18 Thread zhengchenyu (Jira)
zhengchenyu created YARN-10775: -- Summary: Federation: Yarn running app web can't be unable to connect, because AppMaster can't redirect to the right address. Key: YARN-10775 URL: https://issues.apache.org/jira/brows

[jira] [Commented] (YARN-6202) Configuration item Dispatcher.DISPATCHER_EXIT_ON_ERROR_KEY is disregarded

2021-03-29 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-6202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17310564#comment-17310564 ] zhengchenyu commented on YARN-6202: --- [~yufeigu] I agree that exitOnDispatchException sho

[jira] [Comment Edited] (YARN-10642) Race condition: AsyncDispatcher can get stuck by the changes introduced in YARN-8995

2021-03-05 Thread zhengchenyu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17296075#comment-17296075 ] zhengchenyu edited comment on YARN-10642 at 3/5/21, 4:01 PM: -

  1   2   3   >