Hi Marcin, I got a false positive with a compact phrase table (Friday's git revision 4813df). I can't find any hint in the documentation about the format being probabilistic, so I'm assuming it's a bug. Here's how I compacted the model:
~/mosesdecoder/bin/processPhraseTableMin -in /path/to/model/phrase-table.gz -out /path/to/model/phrase-table -threads 12 -alignment-info -encoding None I don't know if you'll be able to reproduce it (with my compacted model, it's reproducible, but I don't know if cmph is deterministic), but here's the model (2MB): https://docs.google.com/open?id=0B5nFI-wykGgndFF6aFQycEZmVnc sennrich@home ~$ echo "to the European Community" | mosesdecoder/bin/queryPhraseTableMin -t phrase-table to the European Community ||| rechts von Ersetzung ||| 0.5 0.00872443 1 0.000228467 2.718 "to the European Community" does not exist in the phrase table, and this phrase pair seems to be used: to the right of Replacement ||| rechts von Ersetzung ||| 0.5 0.00872443 1 0.000228467 2.718 ||| 1-0 2-0 3-1 4-2 ||| 2 1 1 best wishes, Rico _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
