> the maximum allowed limit for a source word fertility
> source length = 1 target length = 28 ratio 28 ferility limit : 9
With GIZA++ training, we have found some warning as above (in the attached
.txt file.) This mailing list archive says the following and I know what
and why.
https://www.mail-archive.com/moses-support%40mit.edu/msg00603.html
Q1) Can we change the preset value "9" to others?
Q2) Why the initial value set to "9?"
Q3) I think the message "ferility limit" should be "fertility limit," so
how do I escalate or report this ? (Or, is a pull request to some
repository available?)
Regards,
Masataka
Executing: (path)bin/GIZA++ -CoocurrenceFile
(path_to_train)train/align/trg-src.cooc -c
(path_to_train)train/align/trg-src.snt -m1 5 -m2 0 -m3 3 -m4 3
-model1dumpfrequency 1 -model4smoothfactor 0.4 -nodumps 1 -nsmooth 4 -o
(path_to_train)train/align/trg-src.giza -onlyaldumps 1 -p0 0.999 -s
(path_to_train)train/align/trg.vcb -t (path_to_train)train/align/src.vcb
Executing: (path)bin/GIZA++ -CoocurrenceFile
(path_to_train)train/align/src-trg.cooc -c
(path_to_train)train/align/src-trg.snt -m1 5 -m2 0 -m3 3 -m4 3
-model1dumpfrequency 1 -model4smoothfactor 0.4 -nodumps 1 -nsmooth 4 -o
(path_to_train)train/align/src-trg.giza -onlyaldumps 1 -p0 0.999 -s
(path_to_train)train/align/src.vcb -t (path_to_train)train/align/trg.vcb
Reading vocabulary file from:(path_to_train)train/align/src.vcb
Reading vocabulary file from:(path_to_train)train/align/trg.vcb
Reading vocabulary file from:(path_to_train)train/align/src.vcb
Reading vocabulary file from:(path_to_train)train/align/trg.vcb
{WARNING:(b)truncated sentence 1099}{WARNING:(b)truncated sentence
1137}{WARNING:(a)truncated sentence 1099}{WARNING:(a)truncated sentence
1137}{WARNING:(b)truncated sentence 2768}{WARNING:(a)truncated sentence
2768}{WARNING:(a)truncated sentence 4875}{WARNING:(b)truncated sentence
4875}{WARNING:(a)truncated sentence 8930}{WARNING:(b)truncated sentence
8930}{WARNING:(a)truncated sentence 12557}{WARNING:(b)truncated sentence
12557}WARNING: The following sentence pair has source/target sentence length
ration more than
the maximum allowed limit for a source word fertility
source length = 1 target length = 28 ratio 28 ferility limit : 9
Shortening sentence
Sent No: 12864 , No. Occurrences: 1
0 1605
47199 39626 13909 29030 760 13920 5649 26337 4446 22689 5649 13729 34683 4600
20668 14153 5649 15533 46156 4531 7238 7182 8179 46156 4570 45769 50363 261
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support