Dear all, I am playing around with Moses for mere mortals (0.991). I have a very large EN>ES TM. When I try to extract the corpus with the Windows add-in, it turns out EN file has 255 559 lines as opposed to 262 858 in the ES one. This will cause problems in the training phase. Any idea how to tackle it? I thought the Windows add-in would provide ready-to-use corpora... Thanks in advance for your help. Cheers,
-- Rubén de la Fuente EN/FR>ES translator [email protected] +34 686 33 59 97 skype: rudelafuente twitter: rubendelafuente <http://twitter.com/rubendelafuente> <http://twitter.com/rubendelafuente> http://es.linkedin.com/in/rubendelafuente www.wordbonds.es Wordbonds in Facebook<http://www.facebook.com/#!/pages/wordbonds/127510957570>
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
