I have a modified version of moses-parallel.pl that uses the qsub -t flag to
submit child jobs as array jobs. I've verified that I get identical results
using the modified version and the current version from trunk.
Before I check this in, I would appreciate it if other could do a small test
run to verify that the modified version works the same on their systems.
Suzy, I'm especially interested in your feedback, since you're running
Torque instead of SGE.

>From your perspective as a user, there is no change in how you call
moses-parallel.pl. The changes that you should expect to see are:


* When you look at your child jobs using qstat or qmon, they will all share
the same job-ID, but will each have a unique ja-task-ID

* Child jobs will all show up with the name MOSES, instead of MOSES-aa,
MOSES-ab, etc. I tried to find a way to maintain the old naming format, but
AFAIK there's no way to do that with array job submission

* The temporary out.job* and err.job* files created during the run will end
with numeric suffixes (corresponding to the child ja-task-ID) instead of the
current alphabetic (-aa, -ab, -ac,...) suffixes. Again, I tried but was
unable to maintain the old naming scheme.


Thanks,
Lane


On Tue, Dec 14, 2010 at 4:07 PM, Lane Schwartz <[email protected]> wrote:

> I was wondering if any consideration has been given to using qsub's job
> array functionality in moses-parallel.pl.
>
> Using the qsub -t flag, jobs can be tied together, so that if the parent
> job is killed via qdel, all of the children are also killed. Currently, if a
> parallel job needs to be killed, the children must be manually deleted. This
> is OK if you only have one parallel job running, but if you have many, and
> you haven't overridden the default job name, things become hairier.
>
> I would potentially be willing to make the change, but I wanted to hear
> people's thoughts on the matter first.
>
> Cheers,
> Lane
>



-- 
When a place gets crowded enough to require ID's, social collapse is not
far away.  It is time to go elsewhere.  The best thing about space travel
is that it made it possible to go elsewhere.
                -- R.A. Heinlein, "Time Enough For Love"

Attachment: moses-parallel.pl
Description: Binary data

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to