Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread support
Hi Ken, I'm created an iARPA file with IRSTLM using the options -n 3 (2 grams), -b (include the s sentence boundary) and -d (subdictionary for ngrams). Then, I used IRSTLM's compile-lm with --text yes to convert to ARPA format. Finally, I ran build_binary to binarize the ARPA format for KenLM.

Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread Nicola Bertoldi
the empty line after each ngram-block is not mandatory in the ARPA format (see http://www.speech.sri.com/projects/srilm/manpages/ngram-format. 5.html) and IRSTLM does not produce it. best regards, Nicola Bertoldi On Oct 26, 2010, at 9:42 AM, supp...@precisiontranslationtools.com

[Moses-support] PBSMT estimation

2010-10-26 Thread sanne
Hi, I have a question about PBSMT estimation. If I understand it correctly this is done in the following manner: - first IBM alignments in both directions - then an aligment heuristic such as grow-diag final - from this we create all possible phrase pairs, with some restrictions (nothing going

[Moses-support] Fwd: PBSMT estimation

2010-10-26 Thread Adam Lopez
On Tue, Oct 26, 2010 at 9:43 AM,  sa...@kortec.nl wrote: Hi, I have a question about PBSMT estimation. If I understand it correctly this is done in the following manner: - first IBM alignments in both directions - then an aligment heuristic such as grow-diag final - from this we create all

Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread Kenneth Heafield
I've fixed this in revision 3657 and tested that it works with a toy IRSTLM example. Sorry about that, Kenneth P.S. a faster version is under code review and coming soon. On 10/26/10 03:57, Nicola Bertoldi wrote: the empty line after each ngram-block is not mandatory in the ARPA format (see

[Moses-support] inputtype for syntax-directed tuning

2010-10-26 Thread John Morgan
-- Regards, John J Morgan Hello, I'm trying to get an English to Pashto system working with syntax-directed models in the ems framework. Tuning with mert fails. I think I've tracked down the problem to the commandline option inputtype = 0. inputtype is set to 3 in the moses.ini, but the

[Moses-support] Malformed input at (blank)

2010-10-26 Thread Mehmet Tatlıcıoğlu
Hi, As far as I remember, there was some people around telling that the decoder, sometimes, dies with the following error message, [ERROR] Malformed input at Expected input to have words composed of 1 factor(s) (form FAC1|FAC2|...) but instead received input with 0 factor(s). Aborted Exit

Re: [Moses-support] inputtype for syntax-directed tuning

2010-10-26 Thread Hieu Hoang
hi john do you know what the error is? It's usually found in the log file TUNING_tune.??.STDERR can you send me an example of your input a few files of the translation model. using inputtype=0 (normal sentence) should be ok if it's a hiero model or a string-to-tree model. you only need

Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread support
Thank you, Ken. I'll update my svn revision. Tom On Tue, 26 Oct 2010 10:18:17 -0400, Kenneth Heafield mo...@kheafield.com wrote: I've fixed this in revision 3657 and tested that it works with a toy IRSTLM example. Sorry about that, Kenneth P.S. a faster version is under code review

Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread support
Thanks Ken. I tested it and it works. FYI, on my first attempt there was a different error. Something about the s token (word?) was missing. I added the s/s tags and re-ran irstlm's build-lm.sh script with option -b (Include sentence boundary n-grams) and the error disappeared. It's pretty fast

Re: [Moses-support] KenLM distributed with Moses

2010-10-26 Thread Kenneth Heafield
Yes, I require s and /s to appear in your ARPA. These tags are important from an output quality perspective (BLEU etc). I'll put that in the documentation when I get around to writing it, but personally think IRST should include them by default. Kenneth On 10/26/10 12:30,

Re: [Moses-support] Malformed input at (blank)

2010-10-26 Thread Mehmet Tatlıcıoğlu
No luck with either way. This error message shows up at just the beginning of the decoding phase after loading phrase table before decoding the first input. -- Mehmet Tatlıcıoğlu Cell Phone: +90 532 201 85 64 On 26 October 2010 17:57, Ondrej Bojar bo...@ufal.mff.cuni.cz wrote: Hi, one more

[Moses-support] inputtype for tre2string tuning

2010-10-26 Thread John Morgan
Hieu, According to the TUNING_tune.?.STDERR file it looks like tuning succeeded and a moses.ini.? file was generated under the tuning directory. I had hierarchical-rule-set = true set in my configuration file. I guess this is the cause of my problems, right? I'm trying to build a tree to string

Re: [Moses-support] inputtype for tre2string tuning

2010-10-26 Thread Philipp Koehn
Hi, the tree-to-string has not been properly integrated into EMS, but that should be pretty straight-forward. We will look into this. -phi On Tue, Oct 26, 2010 at 8:45 PM, Hieu Hoang h...@hoang.co.uk wrote: Hi john, I use the EMS but i don't think it know how to do tree-to-string models.