[ 
https://issues.apache.org/jira/browse/BEAM-7109?focusedWorklogId=230399&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-230399
 ]

ASF GitHub Bot logged work on BEAM-7109:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 20/Apr/19 07:08
            Start Date: 20/Apr/19 07:08
    Worklog Time Spent: 10m 
      Work Description: angoenka commented on pull request #8367: [BEAM-7109] 
Do not reconnect logging at termination
URL: https://github.com/apache/beam/pull/8367
 
 
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 230399)
    Time Spent: 50m  (was: 40m)

> Thread leaking in Portable Python Precommit 
> --------------------------------------------
>
>                 Key: BEAM-7109
>                 URL: https://issues.apache.org/jira/browse/BEAM-7109
>             Project: Beam
>          Issue Type: Bug
>          Components: testing
>            Reporter: yifan zou
>            Assignee: Ankur Goenka
>            Priority: Critical
>         Attachments: threadDump.txt, thread_dump.txt
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> Beam Jenkins constantly break due to some weird errors such as "Unable to 
> create new native thread". The recent build worker failure happened on 
> [apache-beam-jenkins-8] 
> ([https://builds.apache.org/computer/apache-beam-jenkins-8/builds]). Checking 
> the thread number on that VM shows: 
> Thread limit: kernel.pid_max = 32768 
> Actual used: 32411
>  
> Dumping the thread usage (see [^threadDump.txt]) exposed thread leaking on 
> some Python tests. And based on the execution history of the jenkins-8, the 
> [beam_PreCommit_Portable_Python_Commit] 
> ([https://builds.apache.org/job/beam_PreCommit_Portable_Python_Commit]) is 
> suspicious. We ran this test multiple times on a plain node and observed that 
> some thread started by +_apache_beam.runners.worker.sdk_worker_main_+ were 
> not tear down after tests complete. The stale threads finally accumulated and 
> ate the VM kernel thread quota. 
>  
> cc: [~alanmyrvold], [~jasonkuster], [~altay]
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to