[Moses-support] questions

Roberto Rios Wed, 26 Jan 2011 07:51:48 -0800

hello..I finished installing giza++ and moses...it runs good. now i am
proceeding to install mgiza for multithread.....I am having a file issues.


1. in "http://www.statmt.org/wmt07/baseline.html";


   - Copy GIZA++ and mkcls to a bin location for Moses Scripts
    mkdir -p bin
   cp GIZA++-v2/GIZA++ bin/
   cp GIZA++-v2/snt2cooc.out bin/
   cp mkcls-v2/mkcls bin/

      1.1)  where is it i have to copy mgiza, mkcls, mergealignment.py an
snt2cooc?
      1.2) Do i have to replace the old mkcls and snt2cooc.out for the new
ones comming with mgiza?
      1.3) Is there a difference between snt2cooc and snt2cooc.out?

2. the corpus that is been tokenized for the LM; is it the same corpus as
the english corpus?

3. Does tunning takes longer than training?..it took my server a couple of
days for tunning and 4 hours of training....would the time for tunning get
better after the first run?

4. how do i feed directories of corpuses into my system?. I am able to run
the tutorial already mention, but that is only one corpus,, i have a lot of
corpuses organized in directories...trying to do one by one would be a
killer.

5. if i get anew corpus do I need to run training and tunning all again...it
seems that training uses old trained and merges the new corpus into it..is
that correct?

6. I have the last version of moses....the only script i have is
train-model.perl...but for what i read is better to do
train-factored-phrase-model.perl...i
do not have it oin my moses or scripts/2011....../training

thank you
Roberto Rios

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] questions

Reply via email to