you could make use of the arTenTen12 (7 billion tokens) that we have in
Sketch Engine -- drop me an e-mail if you'd be interested.
Moreover, we have already built word embeddings from this corpus if that is
what you are looking for, have a look at
(where you can find more languages, I will send a separate mail to this
list about that.)
CEO, Lexical Computing
Brno, CZ | Brighton UK
On 31 January 2018 at 20:02, Alia Bahanshal <a.bahans...@gmail.com> wrote:
> Is there any open source Arabic corpora I can use for deep learning
> research purposes?
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list