Hi all, I was wondering if it was possible to get GE to output an error message to the stderr file in response to a job being killed due to it exceeding a resource request?
Currently, we have an open doors policy on runtime (ie default h_rt=INFINITY) which is playing havoc with a) long jobs filling up the cluster and precluding short jobs from running (alleviated inefficiently with the introduction of a 'short' queue), and b) preventing efficient resource reservation for parallel SMP jobs. I'd therefore like to change the default time to 30mins, and have users explicitly request more time if they need it. However, I'm worried that the default position of killing jobs with a SIGKILL will confuse users. PBS Pro prints out a message to stderr to tell you why your job was killed (memory, time, io etc exceeded request): is there anything like this in GE I can use? Thanks, Chris _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
