Hi, I have the same goal but for persian, although persian and arabic are different languages but they're using almost same orthography and I'm planning to develope a framework with basic modules for normalizing, stemming, POStagging, syntactic analysis, semantic/sentiment extraction and more. Actually we are a team of 6/7 students (less or more) and each one tries to develope one module as his/her own thesis. The whole effort should be a framework to use in text/audio engineering apps and more importantly for an IR system. Is this architecture suitable for such task and language? Thanks Davood Heidarpour
> Hi, > > at the level of the internal data representation, UIMA certainly supports > arabic. However, specific visualization tools or analysis components may > not support it. So if you want to program your own analysis with UIMA, you > should be ok. If you want to use UIMA out-of-the-box for Arabic or other > RTL languages, you might be hitting a wall. > > If you can explain in more detail what you plan to do, maybe we can give > some more specific pointers. > > Cheers, > > -- Richard > > On 05.08.2015, at 11:09, Khaled Zaki <[email protected]> wrote: > >> hi, >> this is khaled from Cairo University , and I'm using UIMA for the >> first >> time and I'm having a question considering the text mining , I was >> wondering if the UIMA support mining the Arabic language or not and if >> yes >> what should I do , as I have tried to browse an Arabic file but it >> failed >> regards >> Thank You in Advance. > >
