On 3/4/2013 5:30 AM, Diman Karagiozov wrote: > Hi there, > > UIMA does not do out-of-the-box text extraction from various document formats. > For this task you can use TIKA ( http://tika.apache.org/). There is also a UIMA add-on annotator which wraps TIKA and enables it to run inside a UIMA pipeline - perhaps useful if you're going to be combining other analytics with this.
http://uima.apache.org/sandbox.html#tika.annotator -Marshall > > In our project (ATLAS - http://www.atlasproject.eu/) we've developed a text > extraction framework prior UIMA wrapped NLP tools for different languages. Do > not hesitate to contact me if you need more information on this. > > greetings > Diman > > On 03/04/2013 12:26 PM, Mehdi Alaoui Belghiti wrote: >> Hi, >> I was looking for a platform that can make me processing files written in >> different formats (xml, owl, rdf,...) and extract relevant information. So >> i found UIMA. >> However, I found only examples for processing natural language. >> Is UIMA limited to this, or it can allow me for example extracting classes >> or attributes from an a Ecore file? >> >> Thank you for help! I would be happy to find examples of processing more >> complex data. >> > >
