Dear UIMA users,

I am interested in using html files in the UIMA pipeline in a way that I can keep track of found named entities in the files. In other words: I do not want to convert the html to text and process these files but use the original html tags e.g. for visualization enriched with found named entities.

My plan is to use an html parser to find the text snippets of interest in html files but I am not sure about the integration in UIMA. Did anyone implement something like that already? In which way?

Thanks in advance,
Roman

--
Roman Klinger
Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI)
Schloss Birlinghoven
D-53754 Sankt Augustin
Tel.: +49-2241-14-2360
Fax.: +49-2241-14-4-2360
email: [EMAIL PROTECTED]

Reply via email to