Hi all, I train MOSES on the task of a little bit noisy. It means around 90%-95% pairs are bilingual pairs. I have to face with quite disturbing errors.
When I use GIZA++, I'm stuck with the errors, such as: *alignment point (28,29) out of range (0-14,0-9) in line 468517, ignoring* *alignment point (30,31) out of range (0-14,0-9) in line 468517, ignoring* *alignment point (31,31) out of range (0-14,0-9) in line 468517, ignoring* *alignment point (31,32) out of range (0-14,0-9) in line 468517, ignoring* and the next results, for example: *WARNING: sentence 415878 has alignment point (6, 7) out of bounds (7, 7)* *T: øksnes là một đô_thị ở hạt nordland* *S: øksnes is a municipality in nordland county* Otherwise, when I also use MGIZA++ instead of GIZA++, i am stuck in the errors such as: *Use of uninitialized value in substitution ...* *Use of uninitialized value $a in split* or *Use of uninitialized value $a in scalar chomp at scripts/training/LexicalTranslationModel.pm* If you can, please give me any suggestion for this problem? Tks and best regards, C. Hoang -- Best Regards, C. Hoang SMT Nerd
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
