Hi Nick

The best solution is to use the compact phrase table, and for this just add

ttable-binarizer = $moses-bin-dir/processPhraseTableMin

to the general section.

If you need to use the ondisk phrase table (sparse features, properties etc.) then replace the above with

ttable-binarizer = "$moses-bin-dir/CreateOnDiskPt 1 1 4 100 2"

where the "4" is the number of dense features in your phrase table.

It's a bit strange that binarize-all gives you PhraseDictionaryBitextSampling, maybe that's the default these days? Anyway, if you just want to decode a dev & test set then you don't want to set binarize-all

cheers - Barry



On 25/11/15 01:44, Nicholas Ruiz wrote:
Hi all,

I'm a bit behind on my moses versions and I'm using EMS for the first time. I trained a toy model, which gave me a PhraseTableMemory translation table. I'd like to binarize the phrase table and reordering models. I'm still operating back in the PhraseTableBinary days, but obviously the codebase has changed quite a bit since then.

How do I binarize the phrase table as a PhraseDictionaryOnDisk? I had tried uncommenting the binarize-all setting, but that gave me a PhraseDictionaryBitextSampling. However, I don't need to do incremental training -- and the tuning phase is failing anyway. Help would be appreciated about how to do a simple binarization in EMS.

Thanks!
Nick

zınɹ ʞɔıu


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to