Hi, there are a lot of parallel corpora available at the LDC http://www.ldc.upenn.edu/ If paying membership fees is not an option, you could crawl yourself the UN website, which has large quantities of texts in both English and Arabic.
-phi On Tue, Dec 16, 2008 at 10:06 AM, musa ghurab <[email protected]> wrote: > > Dear Marco > > i can answer the second part of your question. > (tokenizer, lowercaser) are working well with arabic language in utf-8 > format, but you have to know that there is no lower case in arabic which > mean there is no need to use lowercaser script. > > > > thank > musa ghurab > > > > > ________________________________ > Date: Tue, 16 Dec 2008 09:51:57 +0000 > From: [email protected] > To: [email protected] > Subject: [Moses-support] Moses on Arabic data > > Dear All, > I'd like to use Moses to translate from Arabic to English. I have few > questions: > 1)Do u know where I can find a parallel corpora Arabic-English? > 2)Do all the Moses tools (tokenizer, lowercaser) work for Arabic language? > > Thanks a lot > Marco > > ________________________________ > Get news, entertainment and everything you care about at Live.com. Check it > out! > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
