Hi all,
I just made a fresh checkout of mosesdecoder, as described in the
instruction of the website.
I noticed that somewhere in the repository, there is a folder named:
mosesdecoder/scripts/moses-for-mere-mortals/data-files/corpora_for_training/
which contains several sets with size 200k sentences each. I don't know
how useful these corpora are, but apparently because of them, it takes
really long to check out moses and sourceforge has a really hard time.
Commiting training data in an SVN repository is not much of a standard
practice.
Can't there be a solution so that these corpora are downloaded by only
the ones that need them?
regards
Eleftherios
--
MSc. Inf. Eleftherios Avramidis
DFKI GmbH, Alt-Moabit 91c, 10559 Berlin
Tel. +49-30-3949-1827
Fax. +49-30-3949-1810
-------------------------------------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern
Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff
Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes
Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------------------------------------
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support