Hi all,
I trained the alignment model on the same data with the same parameters using
GIZA and MGIZA respectively. The training corpus includes 200K sentences. My
server has an Intel Quad CPU i4790K which has 4 cores and each core has 2
threads. It costs 2905 seconds for GIZA. But it costs 5259 seconds for MGIZA
with 3 threads. I think MGIZA is much faster than GIZA. But I got bad result. I
do not know the reason is the compile way or others.
Does anyone has relative experience? Thanks.
The following is the training command for MGIZA. And the training data is the
FBIS zh-en data. But I can not public the data because of copyright.
${mosesScript}/training/train-model.perl \
--external-bin-dir "${binDir}" \
--root-dir "${trainDir}" \
--corpus train \
--f src \
--e ref \
--alignment grow-diag-final-and \
--parallel \
--first-step 1 \
--last-step 3 \
--mgiza --mgiza-cpus 3
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support