On 3/4/2013 5:30 AM, Diman Karagiozov wrote:
> Hi there,
>
> UIMA does not do out-of-the-box text extraction from various document formats.
> For this task you can use TIKA ( http://tika.apache.org/).
There is also a UIMA add-on annotator which wraps TIKA and enables it to run
inside a UIMA pipeline - perhaps useful if you're going to be combining other
analytics with this.

http://uima.apache.org/sandbox.html#tika.annotator

-Marshall
>
> In our project (ATLAS - http://www.atlasproject.eu/) we've developed a text
> extraction framework prior UIMA wrapped NLP tools for different languages. Do
> not hesitate to contact me if you need more information on this.
>
> greetings
> Diman
>
> On 03/04/2013 12:26 PM, Mehdi Alaoui Belghiti wrote:
>> Hi,
>> I was looking for a platform that can make me processing files written in
>> different formats (xml, owl, rdf,...) and extract relevant information. So
>> i found UIMA.
>> However, I found only examples for processing natural language.
>> Is UIMA limited to this, or it can allow me for example extracting classes
>> or attributes from an a Ecore file?
>>
>> Thank you for help! I would be happy to find examples of processing more
>> complex data.
>>
>
>

Reply via email to