I've been using IRSTLM's build-lm.sh to build an LM. Then converted
from iARPA to ARPA with compile-lm --text yes. On a recent data set, I
encountered the following exception with KenLM's build_binary on the
resulting ARPA file:
Reading
cased.irstlm.arpa
----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
***********************terminate
called after throwing an instance of 'lm::FormatLoadException'
what():
Positive probability 4.00956e-08 in the -gram at byte 46366584 in file
cased.irstlm.arpa
Aborted
build_binary is successful with almost
exactly the same data set in lower case.
Any ideas?
Tom
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support