Hi, I'm trying to run a system through the EMS where all of the preprocessing (tokenization, lowercasing) has already been done for all of the training, tuning and evaluation data. The intermediate steps are not available, and I just provide the ultimate lowercased data. In my config file I have e.g.
[CORPUS:combined] lowercased-stem = $wmt10preproc-data/training/lowercased where the directory $wmt10preproc-data/training contains two files, lowercased.de and lowercased.en. The variables raw-stem, tokenized-stem, clean-stem are not set. However when I run the system, it looks like it's still trying to run the get-corpus/tokenize/clean steps - it produces files like steps/1/CORPUS_combined_get-corpus.1* which contain error messages about not being able to find files. What am I missing? Thanks, Suzy -- Suzy Howlett http://web.science.mq.edu.au/~showlett/ _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
