I have a modified version of moses-parallel.pl that uses the qsub -t flag to submit child jobs as array jobs. I've verified that I get identical results using the modified version and the current version from trunk. Before I check this in, I would appreciate it if other could do a small test run to verify that the modified version works the same on their systems. Suzy, I'm especially interested in your feedback, since you're running Torque instead of SGE.
>From your perspective as a user, there is no change in how you call moses-parallel.pl. The changes that you should expect to see are: * When you look at your child jobs using qstat or qmon, they will all share the same job-ID, but will each have a unique ja-task-ID * Child jobs will all show up with the name MOSES, instead of MOSES-aa, MOSES-ab, etc. I tried to find a way to maintain the old naming format, but AFAIK there's no way to do that with array job submission * The temporary out.job* and err.job* files created during the run will end with numeric suffixes (corresponding to the child ja-task-ID) instead of the current alphabetic (-aa, -ab, -ac,...) suffixes. Again, I tried but was unable to maintain the old naming scheme. Thanks, Lane On Tue, Dec 14, 2010 at 4:07 PM, Lane Schwartz <[email protected]> wrote: > I was wondering if any consideration has been given to using qsub's job > array functionality in moses-parallel.pl. > > Using the qsub -t flag, jobs can be tied together, so that if the parent > job is killed via qdel, all of the children are also killed. Currently, if a > parallel job needs to be killed, the children must be manually deleted. This > is OK if you only have one parallel job running, but if you have many, and > you haven't overridden the default job name, things become hairier. > > I would potentially be willing to make the change, but I wanted to hear > people's thoughts on the matter first. > > Cheers, > Lane > -- When a place gets crowded enough to require ID's, social collapse is not far away. It is time to go elsewhere. The best thing about space travel is that it made it possible to go elsewhere. -- R.A. Heinlein, "Time Enough For Love"
moses-parallel.pl
Description: Binary data
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
