Hi ,

Please suggest me the Dataset for production quality translations.
I have tried with Europarl-en-fr-(V7) data (~400MB) with Trigram-KenLM
model i got BLEU score of 24.7.

But results are not satisfactory.

My questions are,
a) Which data set i should use for production quality translation or What
should be size of dataset ?
b) Do i need to consider other approaches over phrase based decoding such
as *hierarchical or factored ?*

I really appreciate if you can give me a head start.

Thanks & Regards,
Alind Billore

-- 
Regards,
Alind Billore
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to