[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: h2. Background GPU topology affects performance. There's been a discussion in

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: h2. Background GPU topology affects performance. There's been a discussion in

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: h2. Background GPU topology affects performance. There's been a discussion in

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: h2. Background GPU topology affects performance. There's been a discussion in

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.017.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758368#comment-16758368 ] Zhankun Tang commented on YARN-9060: [~sunilg] , Thanks. And there's indeed a potential issue that it

[jira] [Comment Edited] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758357#comment-16758357 ] Zhankun Tang edited comment on YARN-9060 at 2/1/19 2:30 PM: [~sunilg] , Thanks

[jira] [Comment Edited] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758368#comment-16758368 ] Zhankun Tang edited comment on YARN-9060 at 2/1/19 2:39 PM: [~sunilg] , 

[jira] [Comment Edited] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758357#comment-16758357 ] Zhankun Tang edited comment on YARN-9060 at 2/1/19 2:29 PM: [~sunilg] , Thanks

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758357#comment-16758357 ] Zhankun Tang commented on YARN-9060: [~sunilg] , Thanks for mentioning this good question. >From what

[jira] [Commented] (YARN-9265) FPGA plugin fails to recognize Intel Processing Accelerator Card

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758312#comment-16758312 ] Zhankun Tang commented on YARN-9265: [~pbacsko] , Thanks for following up the FPGA features. I think

[jira] [Updated] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9190: --- Priority: Major (was: Minor) > [Submarine] Submarine job will fail to run as a first job on a new

[jira] [Commented] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758129#comment-16758129 ] Zhankun Tang commented on YARN-9190: [~sunilg], [~billie.rinaldi], [~leftnoteasy] . I just encountered

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-02-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Attachment: YARN-8821-trunk.005.patch > GPU hierarchy/topology scheduling support >

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757952#comment-16757952 ] Zhankun Tang commented on YARN-9060: [~sunilg] , Thanks for the review! {quote}In below comments, it's

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.016.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.015.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.015.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: (was: YARN-9060-trunk.015.patch) > [YARN-8851] Phase 1 - Support device isolation and

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-30 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Attachment: YARN-8821-trunk.004.patch > GPU hierarchy/topology scheduling support >

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-30 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.014.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-30 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16756238#comment-16756238 ] Zhankun Tang commented on YARN-9060: [~cheersyang] , Thanks for the review! For point 1, added the

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-30 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.013.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Attachment: YARN-8821-trunk.003.patch > GPU hierarchy/topology scheduling support >

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Attachment: YARN-8821-trunk.002.patch > GPU hierarchy/topology scheduling support >

[jira] [Comment Edited] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16754592#comment-16754592 ] Zhankun Tang edited comment on YARN-8821 at 1/29/19 4:43 AM: - [~leftnoteasy] ,

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Comment Edited] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16754592#comment-16754592 ] Zhankun Tang edited comment on YARN-8821 at 1/29/19 4:41 AM: - [~leftnoteasy] ,

[jira] [Commented] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16754592#comment-16754592 ] Zhankun Tang commented on YARN-8821: [~leftnoteasy] , [~cheersyang] , [~sunilg] . Please help to

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Commented] (YARN-9099) GpuResourceAllocator.getReleasingGpus calculates number of GPUs in a wrong way

2019-01-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16754568#comment-16754568 ] Zhankun Tang commented on YARN-9099: [~pbacsko], [~snemeth] . Since it's a quite straight forward fix.

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-27 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Summary: [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-27 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Description: Due to the cgroups v1 implementation policy in linux kernel, we cannot update the value 

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-26 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Attachment: YARN-8821-trunk.001.patch > GPU hierarchy/topology scheduling support >

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-26 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Description: GPU topology affects performance dramatically. There's been a discussion in YARN-7481.

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-26 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.012.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-24 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.011.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-24 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.010.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Comment Edited] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-24 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16751008#comment-16751008 ] Zhankun Tang edited comment on YARN-9060 at 1/24/19 10:56 AM: -- [~cheersyang]

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-24 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16751008#comment-16751008 ] Zhankun Tang commented on YARN-9060: [~cheersyang] , [~sunilg] , The patch consists below key things:

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-24 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.009.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-23 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749802#comment-16749802 ] Zhankun Tang commented on YARN-9205: [~cheersyang], [~sunilg], [~leftnoteasy]. Thanks for the review!

[jira] [Updated] (YARN-9218) When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress();

2019-01-23 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9218: --- Attachment: YARN-9218-trunk.001.patch > When register to MockRM, MockNM spend 30s on >

[jira] [Assigned] (YARN-9218) When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress();

2019-01-23 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang reassigned YARN-9218: -- Assignee: Zhankun Tang > When register to MockRM, MockNM spend 30s on >

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749632#comment-16749632 ] Zhankun Tang commented on YARN-9205: [~cheersyang], [~sunilg] . Thanks for the review. The 3.1/3.2

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-branch-3.1.001.patch > When using custom resource type, application will fail

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-branch-3.2.001.patch > When using custom resource type, application will fail

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749540#comment-16749540 ] Zhankun Tang commented on YARN-9205: [~sunilg], [~cheersyang] . The v09 patch seems ok now.  > When

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.009.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-8821) GPU hierarchy/topology scheduling support

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8821: --- Summary: GPU hierarchy/topology scheduling support (was: GPU hierarchy scheduling support) > GPU

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16748839#comment-16748839 ] Zhankun Tang commented on YARN-9205: [~sunilg], [~leftnoteasy] , The v08.patch is the latest patch

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.008.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.007.patch > When using custom resource type, application will fail to run

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16748738#comment-16748738 ] Zhankun Tang commented on YARN-9205: [~sunilg] , Even we use the prior version of changes, the added

[jira] [Updated] (YARN-9218) When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress();

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9218: --- Description: In a test case, reproduce the issue with below code. And you'll see that this three

[jira] [Updated] (YARN-9218) When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress();

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9218: --- Priority: Major (was: Minor) > When register to MockRM, MockNM spend 30s on >

[jira] [Created] (YARN-9218) When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress();

2019-01-22 Thread Zhankun Tang (JIRA)
Zhankun Tang created YARN-9218: -- Summary: When register to MockRM, MockNM spend 30s on InetAddress.getByName(name).getHostAddress(); Key: YARN-9218 URL: https://issues.apache.org/jira/browse/YARN-9218

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-22 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.006.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-21 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.005.patch > When using custom resource type, application will fail to run

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-21 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16748397#comment-16748397 ] Zhankun Tang commented on YARN-9205: [~leftnoteasy] , Yeah. Please review. Added two test cases for

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-21 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.004.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-21 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.003.patch > When using custom resource type, application will fail to run

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-19 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16747333#comment-16747333 ] Zhankun Tang commented on YARN-9205: Double-checked that reinit method doesn't have this issue.

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-19 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746895#comment-16746895 ] Zhankun Tang edited comment on YARN-9205 at 1/20/19 3:54 AM: - [~leftnoteasy] ,

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746895#comment-16746895 ] Zhankun Tang edited comment on YARN-9205 at 1/19/19 1:52 AM: - [~leftnoteasy] ,

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746895#comment-16746895 ] Zhankun Tang commented on YARN-9205: [~leftnoteasy] , Thanks for review this. It's possible that any

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746429#comment-16746429 ] Zhankun Tang edited comment on YARN-9205 at 1/18/19 4:05 PM: - [~leftnoteasy],

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746429#comment-16746429 ] Zhankun Tang edited comment on YARN-9205 at 1/18/19 4:08 PM: - [~leftnoteasy],

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746429#comment-16746429 ] Zhankun Tang edited comment on YARN-9205 at 1/18/19 4:06 PM: - [~leftnoteasy],

[jira] [Comment Edited] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746429#comment-16746429 ] Zhankun Tang edited comment on YARN-9205 at 1/18/19 4:02 PM: - [~leftnoteasy],

[jira] [Commented] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746429#comment-16746429 ] Zhankun Tang commented on YARN-9205: [~leftnoteasy], [~yuan_zac] . The root cause of this issue seems

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.002.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-18 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Description: In a non-secure cluster. Reproduce it as follows: # Set capacity scheduler in

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Attachment: YARN-9205-trunk.001.patch > When using custom resource type, application will fail to run

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Description: In a non-secure cluster. Reproduce it as follows: # Set capacity scheduler in

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Description: In a non-secure cluster. Reproduce it as follows: # Set capacity scheduler in

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Description: In a non-secure cluster. Reproduce it as follows: # Set capacity scheduler in

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Description: In a non-secure cluster. Reproduce it as follows: # Set capacity scheduler in

[jira] [Updated] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9205: --- Affects Version/s: 3.3.0 > When using custom resource type, application will fail to run due to the

[jira] [Assigned] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang reassigned YARN-9205: -- Assignee: Zhankun Tang > When using custom resource type, application will fail to run due to

[jira] [Created] (YARN-9205) When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION)

2019-01-17 Thread Zhankun Tang (JIRA)
Zhankun Tang created YARN-9205: -- Summary: When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION) Key: YARN-9205

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-16 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.008.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Comment Edited] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-16 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16743774#comment-16743774 ] Zhankun Tang edited comment on YARN-9190 at 1/16/19 9:04 AM: - [~sunilg] , as

[jira] [Comment Edited] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-16 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16743774#comment-16743774 ] Zhankun Tang edited comment on YARN-9190 at 1/16/19 9:08 AM: - [~sunilg] , as

[jira] [Commented] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-16 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16743774#comment-16743774 ] Zhankun Tang commented on YARN-9190: [~sunilg] , as [~billie.rinaldi] mentioned, "yarn app

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-15 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.007.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2019-01-15 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.006.patch > [YARN-8851] Phase 1 - Support device isolation in native

[jira] [Commented] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-14 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16742654#comment-16742654 ] Zhankun Tang commented on YARN-9190: Yeah. And I did below steps to double-check: 3.2 submarine and

[jira] [Commented] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-13 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16741752#comment-16741752 ] Zhankun Tang commented on YARN-9190: [~billie.rinaldi] , Thanks for the reply! One thing I forget to

[jira] [Commented] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-10 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16739899#comment-16739899 ] Zhankun Tang commented on YARN-9190: [~billie.rinaldi] , [~eyang] , [~csingh] . Do you know which

[jira] [Updated] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-10 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9190: --- Description: This issue was found when verifying submarine in Hadoop 3.2.0 RC1 planning. The

[jira] [Updated] (YARN-9190) [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1 cluster

2019-01-10 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9190: --- Summary: [Submarine] Submarine job will fail to run as a first job on a new created Hadoop 3.2.0 RC1

<    1   2   3   4   5   6   7   8   9   10   >