On 06/07/16 04:11, Andrew J. Prout wrote: > [2016-07-05T13:28:55.175] batch_stepd_step_rec_create() failed: Group ID > not found on host
Looking at the code that error appears to be triggered by the C library function getgrgid() not finding the group in question. Do these same groups exist on the compute nodes too? It should be possible to add errno to that error message to find out what the trigger for getgrgid() failing was. Best of luck! Chris -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci