a couple of points:

--you are asking ngram for perplexities scores, but Moses uses log probs
--Moses will append <s> and </s> pseudo-words to the start and end ot
a sentence;  this will change the probabilities

Miles

2009/3/5 Carlos Henriquez <[email protected]>:
>
> Hi all.
>
>
> I'm making some tests extracting the nbest list from moses (-n-best-list 
> option) with all models' weights set to 1 and I don't understand how do you 
> get the final LM score. I'm using srilm.
>
> For instance, my best translation from Chinese to English on sentence 9 was
>
> 9 ||| after three hours .  ||| d: 0 lm: -17.0614 tm: -7.41812 -0.944461 
> -4.79107 -2.87243 w: -4 ||| -37.0874
>
> but if I run ngram alone with the same output sentence
>
> echo "after three hours ." | ngram -order 5 -lm ../marie/lm/train.tok.en.lm 
> -ppl -
>
> the result is very different
>
> file -: 1 sentences, 4 words, 0 OOVs
> 0 zeroprobs, logprob= -7.40966 ppl= 30.3341 ppl1= 71.1892
>
> I tried with some other values from my nbest list and I always found a big 
> difference between the two scores.
>
> If my initial weight is 1, why are the scores so different? I suppose I am 
> misunderstanding something.
>
> The moses command to obtain the n-best-list was
>
> moses -f moses.ini -i ../../corpus/dev.zh -d 1 -tm 1 1 1 1 -lm 1 -w 1 
> -n-best-list devout.moses.nbest 10 -include-alignment-in-n-best true > 
> devout.moses 2> /dev/null
>
> (yep, I'm not using the last tm weight) and the moses.ini file does not have 
> any weights.
>
> --
> Carlos A. HenrĂ­quez Q.
> [email protected]
>
>
>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to