Re: [OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
I am not familiar with attaching debugger to the processes. Other things you asked are as follows: Is this the first time you've ran it (with Open MPI? with any MPI?) *No We have been running this and other models but this problem has arised now * How many processes is the job using? Are you

Re: [OMPI users] HRM problem

2012-04-24 Thread TERRY DONTJE
To determine if an MPI process is waiting for a message do what Rayson suggested and attach a debugger to the processes and see if any of them are stuck in MPI. Either internally in a MPI_Recv or MPI_Wait call or looping on a MPI_Test call. Other things to consider. Is this the first time

Re: [OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
Dear Rayson, That is a Nuemrical model that is written by National weather service of a country. The logs of the model show every detail about the simulation progress. I have checked on the remote nodes as well the application binary is running but the logs show no progress, it is just waiting at

Re: [OMPI users] HRM problem

2012-04-24 Thread Rayson Ho
Seems like there's a bug in the application. Did you or someone else write it, or did you get it from an ISV?? You can log onto one of the nodes, attach a debugger, and see if the MPI task is waiting for a message (looping in one of the MPI receive functions)... Rayson

[OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
Dear All, I am having problem with running an application on Dell cluster . The model starts well but no further progress is shown. It just stuck. I have checked the systems, no apparent hardware error is there. Other open mpi applications are running well on the same cluster. I have tried