Bala,

This is a known problem with the 1.1 series. The bad news is that I know of no fix for this, though many people work around this problem by running a cleanup script after each unclean run. The good news is that the 1.2 series is MUCH better, though still not perfect. I would suggest trying out 1.2 and seeing if it works for you.

Hope this helps,

Tim

On Mar 17, 2007, at 9:58 AM, Bala wrote:

Hi All,
       we have installed 16 node Intel X86_64
dual CPU and dual core cluster( blade servers)
with OFED-1.1, that installs OpenMPI as well.

 we are able to run some sample programs also,
after few time when we run the sample and do
some Ctrl+C to stop the program we notice that
some "orted" is still running and takes 100% cpu
as well.

1. why some times this "orted" process not stopped
   and how to avoid this??

2. we can kill with -9 option, but the problem is
  while running various OpenMPI programs we can
  see each one has one "orted", don't know
  which process is idle to kill.

regards,
Bala.


Reply via email to