Hello,

Has anyone tried to correlate the sentence score returned by Moses (ex: 
[total=-15.451]) with the post-editing effort?

Anything that I have tried so far is a dead end: the Moses score correlates 
with none of: edit time, edit distance, keystrokes, as recorded on actual 
post-editing work performed by professional translators. The only thing that 
shows some sort of linear correlation, although badly (R^2 = 0.5), is the 
sentence length, but that we know (the longer the sentence, the lower the 
score).

Using the sentence score as a confidence indicator is a tempting idea, but if 
it does not correlate with anything in relation with the "value" of the 
machine-translated sentence (in a context where m-translations are reviewed by 
human post-editors), I wonder if this indication can be useful at all.

Did someone already looked at those things?
What do you guys think?


François Masselot
Global Content - Machine Translation

CMS & Language Technologies
Localization Services
Autodesk Neuchatel
+41 32 723 94 60

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to