If you srun a job, then there is no "mpirun" to provide a proc_table. So 
running a  job directly via srun means you cannot run TV on it.


On Feb 10, 2011, at 8:34 AM, Nikolay Piskun wrote:

>  
>    Hi,
> I am trying to use Totalview with srun and hit interesting problem. Looks 
> like if OMPI is started by “srun   –mpi=ompi ”, mpi job is hang in 
> ompi_wait_for_debugger() subroutine. What happen, I think is ompi was 
> compiled without ORTE_DISABLE_FULL_SUPPORT and as result rank 0 is waiting 
> for message from HNP (by the way what is HNP?)  that was supposed to be send 
> by orterun. The problem is that orterun was never invoked because MPI was 
> initiated by srun, not orterun.  So what is the solution? Should we always 
> compile OMPI with  ORTE_DISABLE_FULL_SUPPORT=true for anything that uses 
> different starters like srun from SLURM?
> Thanks
> Nikolay
>  
> Nikolay Piskun | Director of Continuing Engineering | Totalview Technologies |
> Rogue Wave Software Inc  |  24 Prime Parkway, Natick, MA 01760 | p 
> 508-652-7739|
> nikolay.pis...@roguewave.com
> www.roguewave.com
>  
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to