hello..I finished installing giza++ and moses...it runs good. now i am proceeding to install mgiza for multithread.....I am having a file issues.
1. in "http://www.statmt.org/wmt07/baseline.html" - Copy GIZA++ and mkcls to a bin location for Moses Scripts mkdir -p bin cp GIZA++-v2/GIZA++ bin/ cp GIZA++-v2/snt2cooc.out bin/ cp mkcls-v2/mkcls bin/ 1.1) where is it i have to copy mgiza, mkcls, mergealignment.py an snt2cooc? 1.2) Do i have to replace the old mkcls and snt2cooc.out for the new ones comming with mgiza? 1.3) Is there a difference between snt2cooc and snt2cooc.out? 2. the corpus that is been tokenized for the LM; is it the same corpus as the english corpus? 3. Does tunning takes longer than training?..it took my server a couple of days for tunning and 4 hours of training....would the time for tunning get better after the first run? 4. how do i feed directories of corpuses into my system?. I am able to run the tutorial already mention, but that is only one corpus,, i have a lot of corpuses organized in directories...trying to do one by one would be a killer. 5. if i get anew corpus do I need to run training and tunning all again...it seems that training uses old trained and merges the new corpus into it..is that correct? 6. I have the last version of moses....the only script i have is train-model.perl...but for what i read is better to do train-factored-phrase-model.perl...i do not have it oin my moses or scripts/2011....../training thank you Roberto Rios
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
