Re: Discussion of next UIMA release

Tommaso Teofili Wed, 20 May 2009 00:18:07 -0700

2009/5/19 Manuel Fiorelli <[email protected]>

> I would like to see a well-established way to analyze semi-structured
> documents, such as (X)HTML pages. UIMA shouldn't provide its own
> parser, but at least a type system (like uima.cas) to represent a DOM
> Document within a CAS instance (the simplest solution is to represent
> element nodes as feature structures and text nodes as annotations on
> the plain text, but I suspect there are more convenient solutions).
>


I do agree with this.
Tommaso

Re: Discussion of next UIMA release

Reply via email to