Peter Klügl wrote:
Hi,
what is the status quo for the comparison of two CAS right now? Is there
yet any usable solution (with or without documentation)?
I am developing a rule-based system (with scripting functionalities)
especially for complex information and text extraction tasks. The IDE is
DLTK-based and UIMA descriptors (for a generic implementation) are
generated automatically. Currently i am improving a information
extraction application with a test-driven approach. The test cases are,
of course, CAS XMI files and the comparison (of two CAS) is working, but
yet unsatisfying. I am especially interested in annotations for the
false positives and false negatives (overlapping or not overlapping).
Back to my question:
How do you all compare two CAS?
Is there a reusable implementation?
I don't know of one.
Peter
Katrin Tomanek schrieb:
Hi,
Depends what your favorite tooling story is. If you prefer
the eclipse tooling, it should go into eclipse. I know
people who would use this kind of functionality if it was
in CVD :-)
And shouldn't the differences be kept as new annotation types so the
viewers don't need to be changed?
Somehow I don't see that. The tooling could be made a lot
nicer if it knows it's displaying differences. And I wouldn't
want to add annotations to my data just for display purposes.
Or maybe I misunderstood?
Mh, not sure. This is probably data that is only used in evaluation
scenarios, so I don't see a big problem with it.
Well, in our first version we now just add new types. Works OK for us
so far. However, its really just a first version...
Katrin