Dear Vineet
one more thing about parallelization
1) with moses-parallel.pl you can produce "in parallel"
the translations of the many input sentences
and the corresponding nbest, as explained by Marcello.
At the moment you can NOT generate with parallelization
search and word-graphs, but I am working for this.
best,
Nicola
________________________________________
From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Marcello Federico [EMAIL
PROTECTED]
Sent: Sunday, April 20, 2008 11:47 AM
To: Vineet Kashyap; [email protected]
Subject: [Moses-support] R: Running Moses In Parallel
Hi Vineet,
think are much simpler than you can immagine.
1,2. Parallel minimum error training and parallel test scripts basically
split the test set into batches that are translated on different
machines with the same run time code. Hence, there is no
parallelism at the level of single sentences.
7. Parallel word-alignment training uses two distinct processes to training
source-to-target
and targe-to-source alignments.
3,4 It would be interesting to study some parallel implementation of the
search algorithm, but I would suggest to start with some simpler model
to check how much you can gain.
5,6 yes, you just need a sample of human made translations, the larger the
better.
The amount of data actually depends on the difficulty of the task, namely
distance of
languages and vocabulary size. Limited domain tasks, like traveling expressions
(see the IWSLT tasks http://www.slc.atr.jp/IWSLT2008) can be approached with
parallel corpora of 40K sentence pairs. Translation of Europarl or Chinese
news require working with several millions of sentence pairs.
Best,
Marcello
________________________________________
Da: [EMAIL PROTECTED] [EMAIL PROTECTED] per conto di Vineet Kashyap [EMAIL
PROTECTED]
Inviato: sabato 19 aprile 2008 16.29
A: [email protected]
Oggetto: [Moses-support] Running Moses In Parallel
Hello users
I am new to moses and will be using it for my research.
Out of curiosity i needed answers to the following questions:
1. Which parts are made parallel when moses is run on 'n' processors?
2. What happens to the input sentence and what does each processor do
in terms of computation? searching, assigning probability weights ?
3. Can we modify moses-parallel.pl to improve parallelization?
4. Can we use MPICH for parallel implementation?
5. How big the parallel corpora should be to get accurate results? how many
sentences/words?
6. Parallel Corpora consists of the text in source language along with
the translation in the target language. Is that all you need ?
7. Also, while training large data the --parallel option can be used.
Again can we use mpich and which parts are made parallel?
I know these are lot of questions.But it would be highly appreciated
if some one takes the time to answer these.
Thanks in advance.
Regards
Vineet
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support