Hi all,

I just made a fresh checkout of mosesdecoder, as described in the 
instruction of the website.
I noticed that somewhere in the repository, there is a folder named:

     
mosesdecoder/scripts/moses-for-mere-mortals/data-files/corpora_for_training/ 


which contains several sets with size 200k sentences each. I don't know 
how useful these corpora are, but apparently because of them, it takes 
really long to check out moses and sourceforge has a really hard time. 
Commiting training data in an SVN repository is not much of a standard 
practice.
Can't there be a solution so that these corpora are downloaded by only 
the ones that need them?

regards
Eleftherios



-- 
MSc. Inf. Eleftherios Avramidis
DFKI GmbH, Alt-Moabit 91c, 10559 Berlin
Tel. +49-30-3949-1827
Fax. +49-30-3949-1810
-------------------------------------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern

Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff

Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes

Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------------------------------------

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to