Hello all, I am experiencing an odd issue where my head node can see the compute node but the compute node cannot see the headnode. If I run “sinfo” on the head node I see both nodes in the state idle, but I can’t run sinfo on the compute node. If i look at the head nodes logs I see no issues, and I see things like “node_did_resp compute0”. but if I look at the compute nodes log I see “slurm connect failed: no route to host”. I am using the IP addresses that I assigned the nodes in my IPoIB config, and I know these IPs work normally (I can ssh, scp, and ping with them), but for some reason the compute node does not see the head node.
Does anyone have any idea what the issue might be? Thanks, Trevor
