Hello, I have duplicate entries in my rule table, extract.sorted.gz files. I am training using a data set of of size around 1000,000 lines I am translating from english to arabic. Is this normal ? Will removing duplicates affect my decoder ?
Regards, Ayah
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support