Dear UIMA users,
I am interested in using html files in the UIMA pipeline in a way that I
can keep track of found named entities in the files. In other words: I
do not want to convert the html to text and process these files but use
the original html tags e.g. for visualization enriched with found named
entities.
My plan is to use an html parser to find the text snippets of interest
in html files but I am not sure about the integration in UIMA. Did
anyone implement something like that already? In which way?
Thanks in advance,
Roman
--
Roman Klinger
Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI)
Schloss Birlinghoven
D-53754 Sankt Augustin
Tel.: +49-2241-14-2360
Fax.: +49-2241-14-4-2360
email: [EMAIL PROTECTED]