[
https://issues.apache.org/jira/browse/BEAM-7109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16821459#comment-16821459
]
yifan zou edited comment on BEAM-7109 at 4/18/19 8:04 PM:
----------------------------------------------------------
I reproduce it on a non-deploy node: jenkins-14. You can use the commands below
to ssh to the VM and run the test. I will not launch the Jenkins on that node
for now for the investigation purpose.
*gcloud auth login*
*gcloud compute --project "apache-beam-testing" ssh --zone "us-central1-b"
"apache-beam-jenkins-14"*
*sudo su jenkins*
*cd /home/jenkins/jenkins-slave/workspace/testspace/beam // I already clone
the git repo*
*./gradlew --continue --max-workers=12 -Dorg.gradle.jvmargs=-Xms2g
-Dorg.gradle.jvmargs=-Xmx4g :portablePythonPreCommit*
// Other commands I used:
sysctl -a | grep kernel.pid_max (get kernel.pid_max)
ps -eLf (list active threads)
ps -eLf | wc -l (count active threads)
top -u jenkins (list running process)
was (Author: yifanzou):
I reproduce it on a non-deploy node: jenkins-14. You can use the commands below
to ssh to the VM and run the test. I will not launch the Jenkins on that node
for now for the investigation purpose.
*gcloud auth login*
*gcloud compute --project "apache-beam-testing" ssh --zone "us-central1-b"
"apache-beam-jenkins-14"*
*sudo su jenkins*
*cd /home/jenkins/jenkins-slave/workspace/testspace/beam // I already clone
the git repo*
*./gradlew --continue --max-workers=12 -Dorg.gradle.jvmargs=-Xms2g
-Dorg.gradle.jvmargs=-Xmx4g :portablePythonPreCommit*
// Other commands I used:
sysctl -a | grep kernel.pid_max (get kernel.pid_max)
ps -eLf (list active thread)
ps -eLf | wc -l (count active thread)
top -u jenkins (list running process)
> Thread leaking in Portable Python Precommit
> --------------------------------------------
>
> Key: BEAM-7109
> URL: https://issues.apache.org/jira/browse/BEAM-7109
> Project: Beam
> Issue Type: Bug
> Components: testing
> Reporter: yifan zou
> Assignee: Ankur Goenka
> Priority: Critical
> Attachments: threadDump.txt
>
>
> Beam Jenkins constantly break due to some weird errors such as "Unable to
> create new native thread". The recent build worker failure happened on
> [apache-beam-jenkins-8]
> ([https://builds.apache.org/computer/apache-beam-jenkins-8/builds]). Checking
> the thread number on that VM shows:
> Thread limit: kernel.pid_max = 32768
> Actual used: 32411
>
> Dumping the thread usage (see [^threadDump.txt]) exposed thread leaking on
> some Python tests. And based on the execution history of the jenkins-8, the
> [beam_PreCommit_Portable_Python_Commit]
> ([https://builds.apache.org/job/beam_PreCommit_Portable_Python_Commit]) is
> suspicious. We ran this test multiple times on a plain node and observed that
> some thread started by +_apache_beam.runners.worker.sdk_worker_main_+ were
> not tear down after tests complete. The stale threads finally accumulated and
> ate the VM kernel thread quota.
>
> cc: [~alanmyrvold], [~jasonkuster], [~altay]
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)