Hi Joe

You could also look at the entropy of the distribution. I'll leave Matt to post the one-liner for that one,

cheers - Barry

On 13/05/16 15:10, Matt Post wrote:
gzip -cd model/phrase-table.gz | cut -d\| -f1 | sort | uniq -c | sort -nr | head -n5

(according to one definition of "ambiguous")

On May 11, 2016, at 2:53 AM, Joe Jean <[email protected] <mailto:[email protected]>> wrote:

Hello,

How would you go about finding the top 5 most ambiguous words in a translation system just by looking at the phrase table and the lexical translation tables? Thanks.

<http://www.facebook.com/joe.jean3>
_______________________________________________
Moses-support mailing list
[email protected] <mailto:[email protected]>
http://mailman.mit.edu/mailman/listinfo/moses-support



_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to