is it 'cos it's uppercased?

are you sure your input test set doesn't have the Windows return character (0x13)? Get rid of it by doing dos2unix

On 22/01/15 17:11, CHATZITHEODOROU Konstantinos wrote:

Hello Moses,

I have trained an EN-IT phrase based MT Engine with a very high BLEU score. I have also checked the test output and the MT quality is pretty good.

Now, I am trying to translate another document, different from the test set but I realised that a large number of words are left untranslated even if in the phrase translation table these words appear having their own entries (phrase length =1). Moreover, I observed that most of the times, the first word of each sentence is untranslated. I run moses decoder with -output-unknowns option.

Here are some examples:


Think and respond to the questions . - > Think e rispondere alle domande .

                I think . -> Io penso .


One more issue. The word "real" is translated as "rear" (posteriore) in Italian. Again, I checked the phrase translation table entries and there is no matching between these two words. The only match is in the reordering table.


Do you have any idea what is going wrong and what I can do?


Thank you for your time.


Best,

Konstantinos



_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to