can you provide the training corpus so I can verify your results?

On 16 January 2015 at 15:53, Li Xiang <[email protected]> wrote:

> Hi all,
>
> I trained the alignment model on the same data with the same parameters
> using GIZA and MGIZA respectively. The training corpus includes 200K
> sentences. My server has an Intel Quad CPU i4790K which has 4 cores and
> each core has 2 threads. It costs 2905 seconds for GIZA. But it costs 5259
> seconds for MGIZA with 3 threads. I think MGIZA is much faster than GIZA.
> But I got bad result. I do not know the reason is the compile way or others.
>
> Does anyone has relative experience? Thanks.
>
> The following is the training command for MGIZA. And the training data is
> the FBIS zh-en data. But I can not public the data because of copyright.
>
>
> ${mosesScript}/training/train-model.perl \
>  --external-bin-dir "${binDir}" \
>  --root-dir "${trainDir}"  \
>  --corpus train \
>  --f src \
>  --e ref \
>  --alignment grow-diag-final-and \
>  --parallel \
>  --first-step 1 \
>  --last-step 3 \
>  --mgiza --mgiza-cpus 3
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



-- 
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to