Hi Nick
The best solution is to use the compact phrase table, and for this just add
ttable-binarizer = $moses-bin-dir/processPhraseTableMin
to the general section.
If you need to use the ondisk phrase table (sparse features, properties
etc.) then replace the above with
ttable-binarizer = "$moses-bin-dir/CreateOnDiskPt 1 1 4 100 2"
where the "4" is the number of dense features in your phrase table.
It's a bit strange that binarize-all gives you
PhraseDictionaryBitextSampling, maybe that's the default these days?
Anyway, if you just want to decode a dev & test set then you don't want
to set binarize-all
cheers - Barry
On 25/11/15 01:44, Nicholas Ruiz wrote:
Hi all,
I'm a bit behind on my moses versions and I'm using EMS for the first
time. I trained a toy model, which gave me a PhraseTableMemory
translation table. I'd like to binarize the phrase table and
reordering models. I'm still operating back in the PhraseTableBinary
days, but obviously the codebase has changed quite a bit since then.
How do I binarize the phrase table as a PhraseDictionaryOnDisk? I had
tried uncommenting the binarize-all setting, but that gave me
a PhraseDictionaryBitextSampling. However, I don't need to do
incremental training -- and the tuning phase is failing anyway. Help
would be appreciated about how to do a simple binarization in EMS.
Thanks!
Nick
zınɹ ʞɔıu
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support