When I ran tokenisation command for Baseline System from the Moses SMT manual on page 71, the file "news-commentary-v7.fr-en.tok.en" could not be found in the corpus directory. I am not sure if I used the right command as specified in the manual for baseline system tokenisation. I am just wondering if there is anything I need to do which has been left undone.
Clement Odoje Department of Linguistics and African LanguagesUniversity of Ibadan,Ibadan, Nigeria+2348032387999 What you do today becomes history tomorrow, what will you be remembered for? --- On Fri, 23/11/12, Philipp Koehn <[email protected]> wrote: From: Philipp Koehn <[email protected]> Subject: Re: [Moses-support] Training error To: "OYELEKE ODOJE" <[email protected]> Cc: [email protected] Date: Friday, 23 November, 2012, 1:43 Hi, > the following error happens: > ERROR: could not open '/home/mrodoje/corpus/news-commentary-v7.fr-en.tok.en' > at /home/mrodoje/mosesdecoder/scripts/recaser/train-truecaser.perl line 24. Please check the step that should have produced the output file /home/mrodoje/corpus/news-commentary-v7.fr-en.tok.en for what went wrong here. > I think Moses manual 2012 about corpus preparation unzip command > > tar zxvf tar zxvf training-parallel.tgz (page 70) > > should read > > tar zxvf training-parallel.tgz. > > I found out that tar zxvf tar zxvf training-parallel.tgz will give error when > executed. Thanks, I fixed this in the manual. -phi
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
