Damn good catch, Peter.

Christian.

On 5-Sep-08, at 11:49 , Peter Janes wrote:

After having this happen a third time I took a closer look at the database and saw that one of the entries in the BUILDRESULT table had an END_TIME of 0. When I changed that field to a current time and restarted the server Continuum started to work again.

I was able to reproduce the problem by setting the END_TIME of the last BUILDRESULT to 0, starting the server, and waiting for that project to be queued. It was resolved again when I reset END_TIME to its original value.

I've reported this as http://jira.codehaus.org/browse/CONTINUUM-1871 and attached the thread dump.

Thanks,
Peter J.

Brett Porter wrote:
I certainly haven't seen something like this - perhaps something in the project change caused an infinite loop that might be a bug in the Maven
project handling.
Are you able to get a thread dump from Continuum when it occurs?
Thanks,
Brett
2008/9/3 Peter Janes <[EMAIL PROTECTED]>
Since adding a set of projects to a Continuum 1.2 instance last Thursday, the server has stopped performing any builds on any projects. The server had performed hundreds of builds without issue over a period of several
months.

The initial problem that appeared was that projects would be queued but never started; as soon as the first job entered the queue Continuum would start using 100% CPU and never start building anything. Even when I removed all of the projects from the queue it remained at 100% CPU. Restarting the server didn't help: as soon as a job was queued the CPU would spike and no
further progress would be made.

Thinking that somehow the (MySQL) database might be corrupt, I removed it and started fresh. I added the projects that appeared to cause the problem
and they built successfully, so I started re-adding the rest of the
projects. Unfortunately, at some point while I was doing this builds
stopped being executed again: this time they get queued up and are
apparently started, but I don't see any build processes being invoked. Even
the simplest pom-only job times out.  The timeouts are shown in the
Continuum log but aren't being recorded in the database as failures or errors, and restarting doesn't make a difference, which suggests to me that
perhaps the database has been corrupted again.

I don't see anything in Jira or the list archives; has anyone seen
behaviours like this? More importantly, has anyone been able to solve them?

Environment: RHEL 5, Continuum 1.2 (built in May), Java 1.6.0_03, MySQL
5.0.45, Maven 2.0.9

Peter J.
--
Sometimes the Universe needs a change of perspective.
--J. Michael Straczynski



--
Sometimes the Universe needs a change of perspective.
 --J. Michael Straczynski


Reply via email to