Peter Klügl wrote:
Hi,

what is the status quo for the comparison of two CAS right now? Is there yet any usable solution (with or without documentation)?

I am developing a rule-based system (with scripting functionalities) especially for complex information and text extraction tasks. The IDE is DLTK-based and UIMA descriptors (for a generic implementation) are generated automatically. Currently i am improving a information extraction application with a test-driven approach. The test cases are, of course, CAS XMI files and the comparison (of two CAS) is working, but yet unsatisfying. I am especially interested in annotations for the false positives and false negatives (overlapping or not overlapping).

Back to my question:
How do you all compare two CAS?
Is there a reusable implementation?

I don't know of one.



Peter


Katrin Tomanek schrieb:
Hi,


Depends what your favorite tooling story is.  If you prefer
the eclipse tooling, it should go into eclipse.  I know
people who would use this kind of functionality if it was
in CVD :-)


And shouldn't the differences be kept as new annotation types so the viewers don't need to be changed?

Somehow I don't see that.  The tooling could be made a lot
nicer if it knows it's displaying differences.  And I wouldn't
want to add annotations to my data just for display purposes.
Or maybe I misunderstood?

Mh, not sure. This is probably data that is only used in evaluation scenarios, so I don't see a big problem with it. Well, in our first version we now just add new types. Works OK for us so far. However, its really just a first version...

Katrin


Reply via email to