[Moses-support] creating LM with IRST toolkit

2011-11-30 Thread Hieu Hoang
hi all

can anyone tell me if creating LM with the IRST toolkit is integrated into
the EMS yet?

if not, is this the entirety of what has to be run?
  cat $CORPUSFILE | $IRSTLM/bin/add-start-end.sh | gzip -c 
temp/monolingual.setagged.gz
  $IRSTLM/bin/build-lm.sh -t stat4 -i gunzip -c
temp/monolingual.setagged.gz -n 5 -p -o temp/iarpa.gz -k 10
  $IRSTLM/bin/compile-lm temp/iarpa.gz --text yes /dev/stdout | gzip -c 
$LMFILE
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] creating LM with IRST toolkit

2011-11-30 Thread Nicola Bertoldi
Hi Hieu

On Dec 1, 2011, at 8:34 AM, Hieu Hoang wrote:

 hi all
 
 can anyone tell me if creating LM with the IRST toolkit is integrated into 
 the EMS yet?
 

I let anyone else to answer this point.

 if not, is this the entirety of what has to be run?
   cat $CORPUSFILE | $IRSTLM/bin/add-start-end.sh | gzip -c  
 temp/monolingual.setagged.gz 
   $IRSTLM/bin/build-lm.sh -t stat4 -i gunzip -c 
 temp/monolingual.setagged.gz -n 5 -p -o temp/iarpa.gz -k 10 
   $IRSTLM/bin/compile-lm temp/iarpa.gz --text yes /dev/stdout | gzip -c  
 $LMFILE
 

yes, this is the procedure to train a LM with IRSTLM.
If your corpus is not too big and fits in the memory, you
can use the tlm command to esimate the LM  and directly
store it in binary format (skipping the compile-lm step).
Please, see the IRSTLM manual for details on its usage,
and send further questions directly to the irstlm mailing list:
user-irs...@list.fbk.eu


best
Nicola

 
 ___
 Moses-support mailing list
 Moses-support@mit.edu
 http://mailman.mit.edu/mailman/listinfo/moses-support


___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support