We've put together a small corpus (15k sentences training, 2k each of dev/test) of Swahili-English which you can get here: http://demo.clab.cs.cmu.edu/cdyer/gv.sw-en.tar.gz
It's roughly equivalent to the data used for the experiments reported in this paper: http://anthology.aclweb.org//D/D13/D13-1174.pdf On Mon, Jun 30, 2014 at 5:38 PM, Adam Lopez <[email protected]> wrote: > Hi -- Asking on behalf of a colleague: does anyone know of MT systems and/ > or parallel datasets for the languages of Uganda? (Swahili, Luganda, Soga, > Karomojong, Alur, etc.) > -Adam > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
