[jira] [Commented] (YARN-10398) Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache Manager Like DDOS

2020-08-24 Thread Wanqiang Ji (Jira)


[ 
https://issues.apache.org/jira/browse/YARN-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183395#comment-17183395
 ] 

Wanqiang Ji commented on YARN-10398:


Thanks [~wzzdreamer] for the work. As I know the YARN Shared Cache is used to 
all YARN applications, but this PR is related to the MapReduce, so I think we 
should move it to MAPREDUCE project.

> Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache 
> Manager Like DDOS
> ---
>
> Key: YARN-10398
> URL: https://issues.apache.org/jira/browse/YARN-10398
> Project: Hadoop YARN
>  Issue Type: Bug
>  Components: yarn
>Affects Versions: 2.9.0, 3.0.0, 3.1.0, 2.9.1, 3.0.1, 3.0.2, 3.2.0, 3.1.1, 
> 2.9.2, 3.0.3, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3, 3.1.3, 3.2.2, 3.1.4, 3.4.0, 
> 3.3.1, 3.1.5
>Reporter: zhenzhao wang
>Assignee: zhenzhao wang
>Priority: Major
>
> The design of yarn shared cache manager is only to allow application master 
> should upload the jar/files/resource. However, there was a bug in the code 
> since 2.9.0. Every node manager that take the job task will try to upload the 
> jar/resources. Let's say one job have 5000 tasks. Then there will be up to 
> 5000 NMs try to upload the jar. This is like DDOS and create a snowball 
> effect. It will end up with inavailability of yarn shared cache manager. It 
> wil cause time out in localization and lead to job failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-10398) Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache Manager Like DDOS

2020-08-23 Thread zhenzhao wang (Jira)


[ 
https://issues.apache.org/jira/browse/YARN-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182949#comment-17182949
 ] 

zhenzhao wang commented on YARN-10398:
--

[~jiwq] I double checked and confirmed the PR is the fix for the problem. The 
reason why non-application master try to upload is because the clear code 
didn't work. The code and bug are in YARN. MR uses yarn shared cache. I'm not 
sure we should move it MR project.  Thanks.

> Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache 
> Manager Like DDOS
> ---
>
> Key: YARN-10398
> URL: https://issues.apache.org/jira/browse/YARN-10398
> Project: Hadoop YARN
>  Issue Type: Bug
>  Components: yarn
>Affects Versions: 2.9.0, 3.0.0, 3.1.0, 2.9.1, 3.0.1, 3.0.2, 3.2.0, 3.1.1, 
> 2.9.2, 3.0.3, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3, 3.1.3, 3.2.2, 3.1.4, 3.4.0, 
> 3.3.1, 3.1.5
>Reporter: zhenzhao wang
>Assignee: zhenzhao wang
>Priority: Major
>
> The design of yarn shared cache manager is only to allow application master 
> should upload the jar/files/resource. However, there was a bug in the code 
> since 2.9.0. Every node manager that take the job task will try to upload the 
> jar/resources. Let's say one job have 5000 tasks. Then there will be up to 
> 5000 NMs try to upload the jar. This is like DDOS and create a snowball 
> effect. It will end up with inavailability of yarn shared cache manager. It 
> wil cause time out in localization and lead to job failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-10398) Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache Manager Like DDOS

2020-08-22 Thread Wanqiang Ji (Jira)


[ 
https://issues.apache.org/jira/browse/YARN-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182402#comment-17182402
 ] 

Wanqiang Ji commented on YARN-10398:


Hi [~wzzdreamer], I found this topic and PR is not related and I had left a 
comment under this PR.

> Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache 
> Manager Like DDOS
> ---
>
> Key: YARN-10398
> URL: https://issues.apache.org/jira/browse/YARN-10398
> Project: Hadoop YARN
>  Issue Type: Bug
>  Components: yarn
>Affects Versions: 2.9.0, 3.0.0, 3.1.0, 2.9.1, 3.0.1, 3.0.2, 3.2.0, 3.1.1, 
> 2.9.2, 3.0.3, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3, 3.1.3, 3.2.2, 3.1.4, 3.4.0, 
> 3.3.1, 3.1.5
>Reporter: zhenzhao wang
>Assignee: zhenzhao wang
>Priority: Major
>
> The design of yarn shared cache manager is only to allow application master 
> should upload the jar/files/resource. However, there was a bug in the code 
> since 2.9.0. Every node manager that take the job task will try to upload the 
> jar/resources. Let's say one job have 5000 tasks. Then there will be up to 
> 5000 NMs try to upload the jar. This is like DDOS and create a snowball 
> effect. It will end up with inavailability of yarn shared cache manager. It 
> wil cause time out in localization and lead to job failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-10398) Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache Manager Like DDOS

2020-08-22 Thread Wanqiang Ji (Jira)


[ 
https://issues.apache.org/jira/browse/YARN-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182398#comment-17182398
 ] 

Wanqiang Ji commented on YARN-10398:


Hi [~wzzdreamer], thanks for your report. I think we should move this ticket to 
MAPREDUCE project.

> Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache 
> Manager Like DDOS
> ---
>
> Key: YARN-10398
> URL: https://issues.apache.org/jira/browse/YARN-10398
> Project: Hadoop YARN
>  Issue Type: Bug
>  Components: yarn
>Affects Versions: 2.9.0, 3.0.0, 3.1.0, 2.9.1, 3.0.1, 3.0.2, 3.2.0, 3.1.1, 
> 2.9.2, 3.0.3, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3, 3.1.3, 3.2.2, 3.1.4, 3.4.0, 
> 3.3.1, 3.1.5
>Reporter: zhenzhao wang
>Assignee: zhenzhao wang
>Priority: Major
>
> The design of yarn shared cache manager is only to allow application master 
> should upload the jar/files/resource. However, there was a bug in the code 
> since 2.9.0. Every node manager that take the job task will try to upload the 
> jar/resources. Let's say one job have 5000 tasks. Then there will be up to 
> 5000 NMs try to upload the jar. This is like DDOS and create a snowball 
> effect. It will end up with inavailability of yarn shared cache manager. It 
> wil cause time out in localization and lead to job failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-10398) Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache Manager Like DDOS

2020-08-12 Thread zhenzhao wang (Jira)


[ 
https://issues.apache.org/jira/browse/YARN-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176758#comment-17176758
 ] 

zhenzhao wang commented on YARN-10398:
--

[~templedf] Could you please help review this patch? Thanks!

> Every NM will try to upload Jar/Archives/Files/Resources to Yarn Shared Cache 
> Manager Like DDOS
> ---
>
> Key: YARN-10398
> URL: https://issues.apache.org/jira/browse/YARN-10398
> Project: Hadoop YARN
>  Issue Type: Bug
>  Components: yarn
>Affects Versions: 2.9.0, 3.0.0, 3.1.0, 2.9.1, 3.0.1, 3.0.2, 3.2.0, 3.1.1, 
> 2.9.2, 3.0.3, 3.0.4, 3.1.2, 3.3.0, 3.2.1, 2.9.3, 3.1.3, 3.2.2, 3.1.4, 3.4.0, 
> 3.3.1, 3.1.5
>Reporter: zhenzhao wang
>Assignee: zhenzhao wang
>Priority: Major
>
> The design of yarn shared cache manager is only to allow application master 
> should upload the jar/files/resource. However, there was a bug in the code 
> since 2.9.0. Every node manager that take the job task will try to upload the 
> jar/resources. Let's say one job have 5000 tasks. Then there will be up to 
> 5000 NMs try to upload the jar. This is like DDOS and create a snowball 
> effect. It will end up with inavailability of yarn shared cache manager. It 
> wil cause time out in localization and lead to job failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org