Hi Manuel, how about storing the individual documents in XCAS or XMI format? That's probably the simplest and best supported way, though not very compact.
--Thilo Manuel Fiorelli wrote:
Hi, I know that the UIMA architecture provides the CAS (common analysis system) to share analysis data about a single artifact. Is there a standard way to store an annotated corpus, which could be used, for example, to train an AE? Regards, Manuel Fiorelli
