[jira] [Assigned] (YARN-9832) YARN UI has decommissioned nodemanager links

2019-09-16 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph reassigned YARN-9832: --- Assignee: Tarun Parimi (was: Prabhu Joseph) > YARN UI has decommissioned nodemanager links

[jira] [Commented] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread Eric Yang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930716#comment-16930716 ] Eric Yang commented on YARN-9834: - [~shanyu] SSSD does not mirror all users, and it only caches users on

[jira] [Created] (YARN-9836) General usability improvements in showSimulationTrace.html

2019-09-16 Thread Adam Antal (Jira)
Adam Antal created YARN-9836: Summary: General usability improvements in showSimulationTrace.html Key: YARN-9836 URL: https://issues.apache.org/jira/browse/YARN-9836 Project: Hadoop YARN Issue

[jira] [Updated] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9837: --- Attachment: YARN-9837.001.patch > YARN Service fails to fetch status for Stopped apps with bigger

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930656#comment-16930656 ] Hadoop QA commented on YARN-9011: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Created] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9837: -- Summary: YARN Service fails to fetch status for Stopped apps with bigger spec files Key: YARN-9837 URL: https://issues.apache.org/jira/browse/YARN-9837 Project: Hadoop

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-004.patch > Race condition during decommissioning >

[jira] [Commented] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Eric Yang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930770#comment-16930770 ] Eric Yang commented on YARN-9837: - [~tarunparimi] Thank you for the patch. Patch 001 looks good to me,

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930774#comment-16930774 ] Hadoop QA commented on YARN-9011: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930775#comment-16930775 ] Hadoop QA commented on YARN-9837: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread shanyu zhao (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shanyu zhao updated YARN-9834: -- Description: Yarn Secure Container in secure mode allows separation of different user's local files

[jira] [Updated] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread shanyu zhao (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shanyu zhao updated YARN-9834: -- Description: Yarn Secure Container in secure mode allows separation of different user's local files

[jira] [Commented] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread Eric Yang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930980#comment-16930980 ] Eric Yang commented on YARN-9834: - [~shanyu] {quote}I forgot to mention that for Winbind/SSSD to work the

[jira] [Commented] (YARN-5913) Consolidate "resource" and "amResourceRequest" in ApplicationSubmissionContext

2019-09-16 Thread Daniel Templeton (Jira)
[ https://issues.apache.org/jira/browse/YARN-5913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931077#comment-16931077 ] Daniel Templeton commented on YARN-5913: The issue at hand is that in

[jira] [Commented] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread shanyu zhao (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930998#comment-16930998 ] shanyu zhao commented on YARN-9834: --- [~eyang], You are talking about Docker container executor. What I

[jira] [Commented] (YARN-9834) Allow using a pool of local users to run Yarn Secure Container in secure mode

2019-09-16 Thread shanyu zhao (Jira)
[ https://issues.apache.org/jira/browse/YARN-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930945#comment-16930945 ] shanyu zhao commented on YARN-9834: --- Thanks [~eyang]! I forgot to mention that for Winbind/SSSD to work

[jira] [Commented] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931109#comment-16931109 ] Tarun Parimi commented on YARN-9837: Thanks for the review [~eyang] . > YARN Service fails to fetch

[jira] [Commented] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930269#comment-16930269 ] Adam Antal commented on YARN-9814: -- Thanks for the review [~Prabhu Joseph]. Indeed, you're right about

[jira] [Updated] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9814: - Attachment: YARN-9814.004.patch > JobHistoryServer can't delete aggregated files, if remote app root

[jira] [Commented] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Sunil Govindan (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930281#comment-16930281 ] Sunil Govindan commented on YARN-9814: -- Thanks [~adam.antal]. This approach looks fine to me.

[jira] [Commented] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930335#comment-16930335 ] Hadoop QA commented on YARN-9814: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9814: - Attachment: YARN-9814.005.patch > JobHistoryServer can't delete aggregated files, if remote app root

[jira] [Commented] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930476#comment-16930476 ] Hadoop QA commented on YARN-9833: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930481#comment-16930481 ] Adam Antal commented on YARN-9833: -- +1 (non-binding). > Race condition when

[jira] [Commented] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930498#comment-16930498 ] Tarun Parimi commented on YARN-9833: Great find. Recently came across this issue in a production

[jira] [Commented] (YARN-9794) RM crashes due to runtime errors in TimelineServiceV2Publisher

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930503#comment-16930503 ] Tarun Parimi commented on YARN-9794: Thanks [~abmodi],[~Prabhu Joseph] for the reviews and commit. >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-003.patch > Race condition during decommissioning >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: (was: YARN-9011-003.patch) > Race condition during decommissioning >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-003.patch > Race condition during decommissioning >

[jira] [Commented] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930361#comment-16930361 ] Adam Antal commented on YARN-9814: -- Thanks for the review [~sunilg]. - The extra debug logging seems a

[jira] [Updated] (YARN-9733) Method getCpuUsagePercent in Class ProcfsBasedProcessTree return 0 when subprocess of container dead

2019-09-16 Thread qian han (Jira)
[ https://issues.apache.org/jira/browse/YARN-9733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] qian han updated YARN-9733: --- Attachment: YARN-9733.001.patch > Method getCpuUsagePercent in Class ProcfsBasedProcessTree return 0 when >

[jira] [Updated] (YARN-9733) Method getCpuUsagePercent in Class ProcfsBasedProcessTree return 0 when subprocess of container dead

2019-09-16 Thread qian han (Jira)
[ https://issues.apache.org/jira/browse/YARN-9733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] qian han updated YARN-9733: --- Attachment: (was: YARN-9733.001.patch) > Method getCpuUsagePercent in Class ProcfsBasedProcessTree return

[jira] [Updated] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9833: --- Attachment: YARN-9833-001.patch > Race condition when DirectoryCollection.checkDirs() runs during

[jira] [Commented] (YARN-9814) JobHistoryServer can't delete aggregated files, if remote app root directory is created by NodeManager

2019-09-16 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930429#comment-16930429 ] Hadoop QA commented on YARN-9814: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9766) YARN CapacityScheduler QueueMetrics has missing metrics for parent queues having same name

2019-09-16 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930516#comment-16930516 ] Prabhu Joseph commented on YARN-9766: - [~tarunparimi] The patch looks good. +1 (non-binding)

[jira] [Commented] (YARN-9772) CapacitySchedulerQueueManager has incorrect list of queues

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930521#comment-16930521 ] Tarun Parimi commented on YARN-9772: bq. Should we extend the duplicates check (as of now, it does