This might not be the appropriate place to ask this, so forgive me if I should be asking this somewhere else. Any advice on what to look for in our thread dump would be very helpful!
In the past couple of weeks, we have started to see our Jenkins server get into the following state: 1) It's CPU usage on the server is pegged at about 250% 2) The web interface will usually load, although sometimes it is slow. 3) The status of the nodes printed in the left-hand column is very out of date. It appears that all of our nodes are busy, but when you click on the jobs that the column reports they are working on, you discover that the job actually finished many minutes ago (sometimes 30+ minutes). I've just upgraded to the latest version, but shortly after it started, it got into this state again. The CPU usage has been pegged at 250%+ for about an hour, and now the web UI isn't responding (but luckily I was able to navigate to /threadDump before it stopped responding) You can see it here: https://gist.github.com/Taytay/a88fdc72baf745481b86 I am not sure what I should be looking for to be honest. I assume that there is something pretty obvious/abnormal here to explain the high CPU usage, but I don't know what it is. I should mention that our host node does _very_ little. It can only run a single short-lived job that is responsible for kicking off other downstream jobs on the actual nodes themselves. So, I wouldn't expect the usage to be much more than is necessary to drive the engine and web UI. Many thanks for any help you can provide. Taylor -- You received this message because you are subscribed to the Google Groups "Jenkins Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/jenkinsci-users/89dc5adb-2df0-4074-8e6d-c173d54e802c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
