David, Your message bounced because of it matched some spam filtering The problem appears to be either the network configuration in slurm.conf is bad or there are some internal firewall issues. There is a troubleshooting guide online that should help you: https://computing.llnl.gov/linux/slurm/troubleshoot.html#nodes
Thanks, I install slurm on other cluster and when I run the sinfo I got some node with line with state down* (with asterisk). When I login to the node and I try the sinfo I got the following: Sinfo: error: slurm_receive_msg: zero bytes were transmitted or received Slurm_load_partitions: zero bytes were transmitted or received. I try to powe_up, it change to idle* and after 2-3 minutes it is come to down* again with *. I uninstalled and installed slurm again but still the same. How can I fix it? Thanks, _________________________________________________ Dr. David Touati Modeling & Simulation Director of Computational Ballistics & Simulation Group Central Laboratory Division IMI, Israel xilitary Industries P.O.B 1044, Ramat Hahsharon Israel, 47100 Tel: + 972 3 5486717, +972 52 3678427 Fax: + 972 3 5486523 _________________________________________________
