Hi Arefeh,

Have you been able to resolve that issue? Maybe one of your GIZA
alignments is flawed, for instance because the GIZA process was
terminated before is finished. Did you check that both the standard and
the inverse alignment files have the same number of lines?

Check it like this:

$ zcat training/giza.1/de-en.A3.final.gz | wc -l; zcat 
training/giza-inverse.1/en-de.A3.final.gz | wc -l
900000
501713

In that case there would be a problem and you'd have to rerun GIZA in
the inverse direction. If you get the same number of lines and it
matches what you expect to get from your corpus, then it's a different
issue and you have to investigate further.

Cheers,
Matthias


On Mon, 2014-10-06 at 03:13 -0700, Arefeh Kazemi wrote:
> Hi
> I have re-installed moses on my system but I have  a problem with giza
> - symmetrize step.
> it gets some errors of this type:
> Sentence mismatch error! Line #501714
> Sentence mismatch error! Line #501715
> .
> .
> .
> Sentence mismatch error! Line #900000
> 
> 
> all of my data files are in utf8 format and I have run moses
> successfully on these files before.
> 
> 
> any suggestion to fix the problem would be appreciated.
> 
> 
> Regards
> Arefeh
> 
> 
> 
>  
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to