Hi,

does anyone know what step 1 of the moses training script does other  
than produce the dictionaries and the numerical sentences that enable  
GIZA++ to do its job. The reason I ask is that on my machine step 1  
takes just over 70 mins for en-fr Europarl corpus.

My optimised version of data preparation and EM IBM Model 1 completes  
is 121 seconds for five iterations of EM, that's just over 2 minutes.  
Before publishing these results I just wanted to make sure there's  
nothing I've missed about step 1 of the training process. Does it do  
anything at all that influences GIZA++ other than preparing the  
digital sentences?

James

-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to