Hi Ken,
I'm created an iARPA file with IRSTLM using the options -n 3 (2 grams), -b
(include the s sentence boundary) and -d (subdictionary for ngrams).
Then, I used IRSTLM's compile-lm with --text yes to convert to ARPA format.
Finally, I ran build_binary to binarize the ARPA format for KenLM.
the empty line after each ngram-block is not mandatory in the ARPA
format
(see http://www.speech.sri.com/projects/srilm/manpages/ngram-format.
5.html)
and IRSTLM does not produce it.
best regards,
Nicola Bertoldi
On Oct 26, 2010, at 9:42 AM, supp...@precisiontranslationtools.com
Hi,
I have a question about PBSMT estimation. If I understand it correctly
this is done in the following manner:
- first IBM alignments in both directions
- then an aligment heuristic such as grow-diag final
- from this we create all possible phrase pairs, with some restrictions
(nothing going
On Tue, Oct 26, 2010 at 9:43 AM, sa...@kortec.nl wrote:
Hi,
I have a question about PBSMT estimation. If I understand it correctly
this is done in the following manner:
- first IBM alignments in both directions
- then an aligment heuristic such as grow-diag final
- from this we create all
I've fixed this in revision 3657 and tested that it works with a toy
IRSTLM example.
Sorry about that,
Kenneth
P.S. a faster version is under code review and coming soon.
On 10/26/10 03:57, Nicola Bertoldi wrote:
the empty line after each ngram-block is not mandatory in the ARPA format
(see
--
Regards,
John J Morgan
Hello,
I'm trying to get an English to Pashto system working with
syntax-directed models in the ems framework.
Tuning with mert fails.
I think I've tracked down the problem to the commandline option
inputtype = 0. inputtype is set to 3 in the moses.ini, but the
Hi,
As far as I remember, there was some people around telling that the decoder,
sometimes, dies with the following error message,
[ERROR] Malformed input at
Expected input to have words composed of 1 factor(s) (form FAC1|FAC2|...)
but instead received input with 0 factor(s).
Aborted
Exit
hi john
do you know what the error is? It's usually found in the log file
TUNING_tune.??.STDERR
can you send me an example of your input a few files of the translation
model.
using inputtype=0 (normal sentence) should be ok if it's a hiero model or a
string-to-tree model. you only need
Thank you, Ken. I'll update my svn revision.
Tom
On Tue, 26 Oct 2010 10:18:17 -0400, Kenneth Heafield mo...@kheafield.com
wrote:
I've fixed this in revision 3657 and tested that it works with a toy
IRSTLM example.
Sorry about that,
Kenneth
P.S. a faster version is under code review
Thanks Ken. I tested it and it works.
FYI, on my first attempt there was a different error. Something about the
s token (word?) was missing. I added the s/s tags and re-ran irstlm's
build-lm.sh script with option -b (Include sentence boundary n-grams) and
the error disappeared.
It's pretty fast
Yes, I require s and /s to appear in your ARPA. These tags are
important from an output quality perspective (BLEU etc). I'll put that
in the documentation when I get around to writing it, but personally
think IRST should include them by default.
Kenneth
On 10/26/10 12:30,
No luck with either way. This error message shows up at just the beginning
of the decoding phase after loading phrase table before decoding the first
input.
--
Mehmet Tatlıcıoğlu
Cell Phone: +90 532 201 85 64
On 26 October 2010 17:57, Ondrej Bojar bo...@ufal.mff.cuni.cz wrote:
Hi,
one more
Hieu,
According to the TUNING_tune.?.STDERR file it looks like tuning
succeeded and a moses.ini.? file was generated under the tuning
directory.
I had hierarchical-rule-set = true set in my configuration file. I
guess this is the cause of my problems, right?
I'm trying to build a tree to string
Hi,
the tree-to-string has not been properly integrated into EMS,
but that should be pretty straight-forward. We will look into
this.
-phi
On Tue, Oct 26, 2010 at 8:45 PM, Hieu Hoang h...@hoang.co.uk wrote:
Hi john,
I use the EMS but i don't think it know how to do tree-to-string models.
14 matches
Mail list logo