Hi~^^, I'm trying to run SIESTA on two nodes (one node has three
process and the other 6 process).



I constructed diskless cluster which has main node (white) and on
slave node (n0000).

and also using mpich.



When i use only one or another node, siesta work well.



But the error occurs when i use both processes of nodes

Below are the command and host file i used.

============================

mpirun -np 6 -f host_file siesta < h2o.fdf | tee h2o.out


[host_file]

white:3

n0000:3

============================



Below is the error message

============================

Fatal error in PMPI_Comm_split: Other MPI error, error stack:
PMPI_Comm_split(418)..............: MPI_Comm_split(MPI_COMM_WORLD, color=2,
key=3, new_comm=0xd7e5a0) failed PMPI_Comm_split(400)..............:
MPIR_Comm_split_impl(88)..........: MPIR_Allgather_impl(744)..........:
MPIR_Allgather(705)...............: MPIR_Allgather_intra(177).........:
MPIC_Sendrecv(186)................: MPIC_Wait(528)....................:
MPIDI_CH3I_Progress(335)..........: MPID_nem_mpich2_blocking_recv(891):
MPID_nem_tcp_connpoll(1830).......: Communication error with rank 2:

============================



I think the broadcast or slit method not working well for each node.

What's the matter? and how can i solve the problem?

Responder a