[Moses-support] [moses]handling different with different amount of lines

Rubén de la Fuente Thu, 14 Jul 2011 07:40:38 -0700

Dear all,

I am playing around with Moses for mere mortals (0.991). I have a very large
EN>ES TM. When I try to extract the corpus with the Windows add-in, it turns
out EN file has 255 559 lines as opposed to 262 858 in the ES one. This will
cause problems in the training phase. Any idea how to tackle it? I thought
the Windows add-in would provide ready-to-use corpora... Thanks in advance
for your help. Cheers,


-- 
Rubén de la Fuente
EN/FR>ES translator
[email protected]
+34 686 33 59 97
skype: rudelafuente
twitter: rubendelafuente <http://twitter.com/rubendelafuente>
<http://twitter.com/rubendelafuente>
http://es.linkedin.com/in/rubendelafuente
www.wordbonds.es
Wordbonds in Facebook<http://www.facebook.com/#!/pages/wordbonds/127510957570>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] [moses]handling different with different amount of lines

Reply via email to