Hi

(First of all a BIG THANKS to ALL open source developers at UIMA and the other projects I mention below whom I am now relying on :-) ).

I am looking at researching a particular knowledge base extraction task using UIMA components as part of the solution. To do this work I need UIMA output persistence and to be able to visualise this output as multiple annotation layers on the same text. Ultimately I want my automated annotations and visualisations to be web based and allow me to make additional manual annotations if required. Once I have my multiple annotations made on a text I will then be able to apply my new knowledge extraction logic.

I have looked at webanno (which incorporates Brat for its UI) and U-Compare as well as Argo (See https://code.google.com/p/webanno/, http://brat.nlplab.org/, http://u-compare.org/, http://nactem.ac.u/ucompare/downloads/, http://argo.nactem.ac.uk/about-argo/). I had hoped that I could use webanno for this task however webanno does not allow the direct import of UIMA components or UIMA output. I found that I could get U-Compare to work as I wanted and it shows promise however if I get my any configuration wrong between any UIMA components it crashes out. I got the software to work for me after I spent more time reading the manual. I found I needed to manually configure the input types for each component in the pipeline. The software recognises subsequent pipeline component compatibility when a new component is added to a work flow. My initial errors came as I had initially expected subsequent U-Compare components to automatically pick up their input from the output from previous workflow components. Whilst the U-compare software does support the saving of previous session data the software is not fully open source so I do not have easy access to this data. I have not looked at the webservice pipeline generation fetaures of U-Compare as yet; this might hold promise if it gives me a download configuration rather than a hosted solution. When I looked at the argo tool I had similar problems with a lack of output. I would assume for the same reasons. Again Argo is not fully open source so I cannot work on modifying this tool to my own ends. Are there any other better tools available that support web based UIMA layered visualisation and output persistence?

Currently I plan to continue to experiment with UIMA components using U-compare however I am looking to implement persistence and visualisation in a production tool. If someone already has a good open source implementation of this need I would prefer not to spend time reinventing this particular wheel.

I would be very happy if the U-compare and webanno teams would work together and get their software integrated. I will pass this mail onto these teams as a suggestion.

The particular data extraction task I am interested in is different to the current popular research shared task (http://www.nist.gov/tac/2014/KBP/) and one which I plan to share once have made some progress.

Thanks in advance.

Further information about me and my project can be found at www.hemseye.org

James Kitching

Reply via email to