Hi all,

Sorry if this question is answered somewhere, but I can't seem to find what
I need in the archives or in the moses documentation (or in my perusal of
the relevant files of the code).

What I want to know is what the scores in the lexicalized reordering tables
correspond to (precisely, I mean -- I know they are smoothed MSD relative
frequencies).

What I mean is, does:

blah ||| blee ||| 0.1 0.2 0.3 0.4 0.5 0.6

mean that blah<->blee is 0.1 likely to remain monotone wrt the previous
phrase (backward monotone relative freq), or wrt the next phrase (forward
monotone relative freq)?  Does 0.5 refer to a swap relative frequency wrt
the previous phrase or the next?  Is 0.6 'other' (discontiguous) wrt the
previous or next phrase, etc., etc.?  It seems to me (from actually
computing relative frequencies) that the scores are:

blah ||| blee ||| <mono-backward> <swap-backward> <other-backward>
<other-forward> <swap-forward> <mono-forward>

but that doesn't make much sense. E.g., why are they reversed? Anyhow, I'd
like to know what they are so that I can plug in my own hand-rolled
reordering tables.

Oh, and while I'm on it, does anyone have an opinion on whether
"word-based" reordering extraction works better or worse or is generally
indistinguishable from "phrase-based" reordering extraction.

Best,
Dennis
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to