you can try using the KENLM estimation tool
http://www.statmt.org/moses/?n=FactoredTraining.BuildingLanguageModel#ntoc20
On 21/09/2015 13:38, Shubham Tripathi wrote:
I have been following your tutorials on building a language model and
I am facing difficulties in it. The web page I am referring is
http://www.statmt.org/moses/?n=Moses.Baseline
On building the final model, the following error occurs -
/*Cleaning temporary directory /tmp*/
/*
*/
/*Warning: some temporary files could not be removed*/
/*
*/
/*Extracting dictionary from training corpus*/
/*
*/
/*Splitting dictionary into 3 lists*/
/*
*/
/*Extracting n-gram statistics for each word list*/
/*
*/
/*Important: dictionary must be ordered according to order of
appearance of words in data*/
/*used to generate n-gram blocks, so that sub language model blocks
results ordered too*/
/*dict.**/
/*
*/
/*$bin/ngt -i="$inpfile" -n=$order -gooout=y -o="$gzip -c >
$tmpdir/ngram.${sdict}.gz" -fd="$tmpdir/$sdict" $dictionary
-iknstat="$tmpdir/ikn.stat.$sdict" >> $logfile 2>&1*/
/*
*/
/*Estimating language models for each word list*/
/*
*/
/*ls: cannot access /tmp/dict.*: No such file or directory*/
/*
*/
/*Merging language models into BanglaFinal.txt*/
/*
*/
/*Cleaning temporary directory /tmp*/
The output file is not made in any of the directories. I am unable to
understand /cannot access /tmp/dict.*: No such file or directory
/error. Also, on checking the directory 'tmp', I find no sub directory
named as 'dict'.
Regards,
*Shubham Tripathi*
Pre- Final Year, Electrical Engineering Department
National Institute of Technology, Jaipur, India - 302017
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
--
Hieu Hoang
http://www.hoang.co.uk/hieu
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support