In the past I've never been able to get the training script to run to 
completion without rigorously following the instructions here 
http://www.statmt.org/moses/?n=moses.baseline



1) Tokenise

2) Train truecaser

3) Truecase

4) Clean


What if somebody wants to just tokenize and clean without truecasing or just 
clean without tokenizing? Why should the script bomb out? Is this something to 
do with formats required by early stages of the training process?


James


NOTE: This is not an open invitation to discuss why somebody would want to 
train models without tokenzing or truecasing. This is nothing more than a 
request for technical assistance.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to