On 28/10/2015 14:20, Vincent Nguyen wrote:
> Hi there,
>
> I need some clarification before screwing up some files.
> I just setup a SGE cluster with a Master + 2 Nodes.
>
> to make it clear let say my cluster name is "default", my master
> headnode is "master", my 2 other nodes are "node1" and "node2"
>
>
> for EMS :
>
> I opened the default experiment.machines file and I see :
>
> cluster: townhill seville hermes lion seville sannox lutzow frontend
> multicore-4: freddie
> multicore-8: tyr thor odin crom
> multicore-16: saxnot vali vili freyja bragi hoenir
> multicore-24: syn hel skaol saga buri loki sif magni
> multicore-32: gna snotra lofn thrud
>
> townhill and others are what ? name machines / nodes ? name of several
> clusters ?
> should I just put "default" or "master node1 node2" ?
I think you put 'default'. townhill seville etc were the name of the
master nodes in Edinburgh.
Using Moses with SGE with multiple nodes hasn't been done for a long
time so there may encounter problems. Philipp Koehn may have started
using it again.
I also used SGE extensively a few months ago but it runs on 1 (big) node
from start to finish. The script for it is here if you want to take a look
scripts/ems/support/submit-grid.perl
It has hardcoded initialisatio for the machine i was running on. You're
welcome to generalise it
>
> multicore-X: should I put machine names here
> if my 3 machines are 8 cores each
> multicore-8: master node1 node2
> right ?
I think this is for running on 1 machine, rather than the cluster. I'm
not sure what the purpose of
multicore-X
is
>
>
> then in the config file for EMS:
>
> #generic-parallelizer =
> $moses-script-dir/ems/support/generic-parallelizer.perl
> #generic-parallelizer =
> $moses-script-dir/ems/support/generic-multicore-parallelizer.perl
>
> which one should take if my nodes are multicore ? still the first one ?
>
>
> ### cluster settings (if run on a cluster machine)
> # number of jobs to be submitted in parallel
> #
> #jobs = 10
> should I count approx 1 job per core on the total cores of my 3 machines ?
>
> # arguments to qsub when scheduling a job
> #qsub-settings = ""
> can this stay empty ?
>
> # project for priviledges and usage accounting
> #qsub-project = iccs_smt
> standard value ?
>
> # memory and time
> #qsub-memory = 4
> #qsub-hours = 48
> 4 what ? GB ?
>
> ### multi-core settings
> # when the generic parallelizer is used, the number of cores
> # specified here
> cores = 4
> is this ignored if generic-parallelizer.perl is chosen ?
>
>
> is there a way to put more load on one specific node ?
>
> Many thanks,
> V.
>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
--
Hieu Hoang
http://www.hoang.co.uk/hieu
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support