is it 'cos it's uppercased?
are you sure your input test set doesn't have the Windows return
character (0x13)? Get rid of it by doing dos2unix
On 22/01/15 17:11, CHATZITHEODOROU Konstantinos wrote:
Hello Moses,
I have trained an EN-IT phrase based MT Engine with a very high BLEU
score. I have also checked the test output and the MT quality is
pretty good.
Now, I am trying to translate another document, different from the
test set but I realised that a large number of words are left
untranslated even if in the phrase translation table these words
appear having their own entries (phrase length =1). Moreover, I
observed that most of the times, the first word of each sentence is
untranslated. I run moses decoder with -output-unknowns option.
Here are some examples:
Think and respond to the questions . - > Think e
rispondere alle domande .
I think . -> Io penso .
One more issue. The word "real" is translated as "rear" (posteriore)
in Italian. Again, I checked the phrase translation table entries and
there is no matching between these two words. The only match is in
the reordering table.
Do you have any idea what is going wrong and what I can do?
Thank you for your time.
Best,
Konstantinos
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support