Hi all, Sorry if this question is answered somewhere, but I can't seem to find what I need in the archives or in the moses documentation (or in my perusal of the relevant files of the code).
What I want to know is what the scores in the lexicalized reordering tables correspond to (precisely, I mean -- I know they are smoothed MSD relative frequencies). What I mean is, does: blah ||| blee ||| 0.1 0.2 0.3 0.4 0.5 0.6 mean that blah<->blee is 0.1 likely to remain monotone wrt the previous phrase (backward monotone relative freq), or wrt the next phrase (forward monotone relative freq)? Does 0.5 refer to a swap relative frequency wrt the previous phrase or the next? Is 0.6 'other' (discontiguous) wrt the previous or next phrase, etc., etc.? It seems to me (from actually computing relative frequencies) that the scores are: blah ||| blee ||| <mono-backward> <swap-backward> <other-backward> <other-forward> <swap-forward> <mono-forward> but that doesn't make much sense. E.g., why are they reversed? Anyhow, I'd like to know what they are so that I can plug in my own hand-rolled reordering tables. Oh, and while I'm on it, does anyone have an opinion on whether "word-based" reordering extraction works better or worse or is generally indistinguishable from "phrase-based" reordering extraction. Best, Dennis
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
