Hi Davood

It isn't normal to get such large differences in phrase table size or quality, on the same data set, although small variations are possible. You should check carefully that you used exactly the same settings in each run, and check if anything went wrong during training (errors in the log file),

cheers - Barry

On 16/06/15 12:00, Davood Mohammadifar wrote:
Hello everyone

I used Moses 3 for training my parallel corpus. I gained different BLEU scores (18.5-22.5); So i tried to find the reason. Finally, I understood that phrase tables are different from each other. I trained 50000 parallel sentences and the size of phrase table, for the first time was about 39MB (gz format) and in second time, it was about 59MB (gz format). Also the phrase tables' content are somewhat different (in scores, and entries).

I used Mgiza and followed the instructions for baseline system in Moses manual. The problem was remained by using Giza++, too.

The problem was remained in training of 150000 sentences, too.

Is different size of phrase tables, normal?

Thank you


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to