Hi there,
UIMA does not do out-of-the-box text extraction from various document
formats.
For this task you can use TIKA ( http://tika.apache.org/).
In our project (ATLAS - http://www.atlasproject.eu/) we've developed a
text extraction framework prior UIMA wrapped NLP tools for different
languages. Do not hesitate to contact me if you need more information on
this.
greetings
Diman
On 03/04/2013 12:26 PM, Mehdi Alaoui Belghiti wrote:
Hi,
I was looking for a platform that can make me processing files written in
different formats (xml, owl, rdf,...) and extract relevant information. So
i found UIMA.
However, I found only examples for processing natural language.
Is UIMA limited to this, or it can allow me for example extracting classes
or attributes from an a Ecore file?
Thank you for help! I would be happy to find examples of processing more
complex data.