2009/5/19 Greg Holmberg <[email protected]>: > I sketched a possible solution to this on the wiki > (http://cwiki.apache.org/UIMA/uima-sandbox-components.html, see "Document > model") back in 2007, but it didn't generate much interest. There's also a > proposal for document properties, beyond the simple > SourceDocumentInformation class.
I'm happy to see I am not the only who feels this feature to be useful. I saw that in your model, every node is an annotation, which is fine to easily implement the property "textContent", which returns the text contained in an Element. Also the support for pdf (and other document formats) would be an important addition... Manuel Fiorelli
