Thanks for your help, everyone. I think that I will first explore using GATE as Julien suggests below. However, if anyone had any native UIMA tool for doing manual annotations, it would be much appreciated. Thilo, would your tool work as a temporary solution?
It seems to me that having some sort of solution here would be an important part of offering a complete UIMA toolset. In our organization, we are planning on working with both existing corpora and corpora which are more specific to the domain on which we are working. There is also the problem of testing our NLP solution on documents of interest to us. So there will be many scenarios in which it won't be sufficient to simply use standard corpora and we will need to do some annotation ourselves. Thanks again, Andrew Borthwick On 6/1/07, Julien Nioche <[EMAIL PROTECTED]> wrote:
GATE (http://gate.ac.uk) is open source and allows to create annotations manually. The interface is tightly bound to the GATE API so porting it to UIMA would be a relatively costly operation. It would certainly be easier to write a new annotation tool from scratch. However GATE could be used in the meantime to annotate documents and save them as XML, which could be loaded by UIMA at a later stage. There is also a UIMA plugin for GATE which allows to call UIMA processes from GATE and vice versa; but I am not sure it works with the Apache version of UIMA. That could help using existing UIMA resources for pre-annotating documents. Hope that helps Julien Thilo Goetz wrote: BTW, I have recently hacked UIMA's CAS Visual Debugger for a colleague to allow creating manual annotations. That was a one-off, though, and I haven't fed it back into the main code base. If people are interested in that kind of functionality, let me know. We wouldn't want to compete with a dedicated annotation tool, though. I would like to second Andrew Borthwick's original request for a UIMA-savvy annotation tool. Adding it to a full-featured annotator would probably be great, but having an open source option would offer the most potential upside for UIMA. Alembic and its replacement Callisto are free, but not open source, so I believe MITRE would have to add support for UIMA themselves. Are there any open source annotators people would recommend for integrating with UIMA? -j -- Joe Andrieu SwitchBook Software http://[EMAIL PROTECTED] +1 (805) 705-8651
