Test failures - concurrency bugs or Socket problems?

Peter Firmstone Tue, 19 Feb 2013 02:14:57 -0800

Before I get into the details, let me just say that I'm unable toreproduce the test failures seen on OSX and Solaris x64 on localhardware, I don't have access to a debugger or thread dumps.

The tests that fail on OSX and Solaris x64 (the tests pass on sparc),are practically identical. The basic problem is discovery event's areeither not received or only some discovery events are received. Thetests allow very long time frames for these events to be received, onother OS's these tests pass rapidly.

Increasing debugging output has the effect of increasing the number ofevents received.


The tests and their details can be viewed on Jenkins.

Over the last few months I've been inspecting code manually and fixingsynchronization issues.

River has a large legacy codebase, there are many examples of inadequatesynchronization.

Ironically some of the changes I've made, although reducing testfailures on Linux and Windows has exacerbated test failures on OSX andSolaris x64.

Is there anyone on this list with access to this hardware who canreproduce these bugs?


Regards,

Peter.

Test failures - concurrency bugs or Socket problems?

Reply via email to