Hi,

there are a lot of parallel corpora available at the LDC
http://www.ldc.upenn.edu/ If paying membership fees is
not an option, you could crawl yourself the UN website,
which has large quantities of texts in both English and
Arabic.

-phi

On Tue, Dec 16, 2008 at 10:06 AM, musa ghurab <[email protected]> wrote:
>
> Dear Marco
>
> i can answer the second part of your question.
> (tokenizer, lowercaser) are working well with arabic language in utf-8
> format, but you have to know that there is no lower case in arabic which
> mean there is no need to use lowercaser script.
>
>
>
> thank
> musa ghurab
>
>
>
>
> ________________________________
> Date: Tue, 16 Dec 2008 09:51:57 +0000
> From: [email protected]
> To: [email protected]
> Subject: [Moses-support] Moses on Arabic data
>
> Dear All,
> I'd like to use Moses to translate from Arabic to English. I have few
> questions:
> 1)Do u know where I can find a parallel corpora Arabic-English?
> 2)Do all the Moses tools (tokenizer, lowercaser) work for Arabic language?
>
> Thanks a lot
> Marco
>
> ________________________________
> Get news, entertainment and everything you care about at Live.com. Check it
> out!
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to