thanks Germán
i know this is just warning, but the problem after this warning is conversion 
process stoped.
the size of the phrasetable is 350MB, whereas the binary-phrasetable is 170MB?


Well... this *may* be normal... is the original phrasetable filtered according to the test data? (i.e. using the filter-model-given-input.pl script) I am not much of an expert at binarizing the tables myself, but I would guess that, if the binarizer detects out-of-vocabulary words, it might also be able to do some filtering. If nobody else answers to this question, try using the filter-model-given-input.pl command, and then binarize the resulting phrase table and see if that covers the size mismatch to some extent. Note that such two commands would be the same as using the filter-and-binarize-model-given-input.pl script, which would do both at the same time, but we would not be able to check the sizes.

Best,

Germán Sanchis-Trilles
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to