Hi, [Its my first mail to the slurm-dev mailing list.]
I am getting the following error while submitting the job to the Slurm - Error : This is observed on both - master as well as slave -------------------------------------------------------- mybin: bind: resource busy (Address already in use) srun: error: machine1: tasks 0-1,3: Exited with exit code 1 ---------------------------------------------------- Parameters of job submitted to SLURM - -------------------------------------- #SBATCH --time=00:02:00 #SBATCH --exclusive #SBATCH --ntasks=4 #SBATCH --cpus-per-task=1 #SBATCH --nodes=4 #SBATCH --ntasks-per-node=1 #SBATCH --mem-per-cpu=128 srun -p partition1 --nodelist=machine1 --exclusive ./mybin slave <IP address> <port no.> & srun -p partition1 --nodelist=machine2 --exclusive ./mybin master <IP address> <port no.> ----------------------------------- Where, mybin = haskell binary. Program - mybin - In Cloud haskell, communication between nodes achieved by using master-slave model (Other models are also there). Here, out of 4 allocated nodes, one will behave as master and remaining as slave and do further computation. Please let me know, if you need further details. Thanks in advance, Jaya
