We've put together a small corpus (15k sentences training, 2k each of
dev/test) of Swahili-English which you can get here:
http://demo.clab.cs.cmu.edu/cdyer/gv.sw-en.tar.gz

It's roughly equivalent to the data used for the experiments reported
in this paper:
http://anthology.aclweb.org//D/D13/D13-1174.pdf

On Mon, Jun 30, 2014 at 5:38 PM, Adam Lopez <[email protected]> wrote:
> Hi -- Asking on behalf of a colleague: does anyone know of MT systems and/
> or parallel datasets for the languages of Uganda? (Swahili, Luganda, Soga,
> Karomojong, Alur, etc.)
> -Adam
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to