Hi Walt,

Just to follow up, I valgrind the server occasionally. It works fairly well for finding memory errors (it found that early free with BMI_memfree we had last week). I usually get a couple errors when the server starts that can be ignored, IIRC they're epoll related. Its especially nice for mem leak checking...RobL and I have talked about adding a nightly set of server tests where the server runs in valgrind to check for mem leaks and other errors valgrind may find, but talk is as far as we've gotten.

As a side note, I've also tried to use valgrind to find some race conditions (with the helgrind tool) in the client api that appear when using multiple threads. The issue is that the latest version of valgrind doesn't support the helgrind tool, so you have to go back and use 2.2.0, which turns out to work fairly well, although I was getting a lot of false positives.

-sam

On Aug 29, 2006, at 9:55 AM, Walter B. Ligon III wrote:

I've got RHEL3 and I'm running valgrind 3.1.1 - seems to be working OK now, and its really not too bad. I think I'm homing in on it, though any hints for making it easier are welcome!

Right now it looks like I'm calling job_null with a bad smcb - things are a little weird ...

Usually, when we start a new SM we immediately begin executing it until it blocks, then we return to what we were doing. The test I'm running is weird because the little SM I'm running concurrently in the middle of a get_config doesn't DO anything, so it never blocks, so it actually runs to completion before returning to the caller.

Now, I considered altering this so that the new SMs are NOT started immediately, but scheduled to run, but after talking to Pete decided not to do that - after all, REAL SMs will block, and this might reduce latency.

So, what I'm getting it is, I might be calling job_null on a SM that later completes and gets freed. I'm not sure, I'm trying to figure that out and decide on a way to cleanly deal with that.

(you may recall I said back in July that termination was the real bug-a-boo to this thing)

Walt

Rob Ross wrote:
hey walt,
i've had mixed results. what os and distro are you trying?
rob
Walter B. Ligon III wrote:
has anyone run the server with valgrind?
I have a nasty memory bug and I'm trying to use it to find the problem, but the server won't run with it - tells me some of the db functions don't work????

Any experience?

Walt

--
Dr. Walter B. Ligon III
Associate Professor
ECE Department
Clemson University
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers


_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Reply via email to