Dear Sam,

Sam Fisher wrote:
I'm probably not using Jericho correctly, because the output of the parser is the same as what went in (not stripped down to only the text content).


I also think so ;-). I experimented with Jericho in UIMA and did not have any problems.

Has anyone had success using jericho with uima?


How did you use Jericho?

I did not have any problems with

new Source(new StringReader("<html>Te<b>s</b>t<html>")).getTextExtractor();

or in UIMA with

new Source(new StringReader(jCas.getDocumentText())).getTextExtractor();


Best regards,
Roman


--
Roman Klinger
Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI)
Schloss Birlinghoven
D-53754 Sankt Augustin
Tel.: +49-2241-14-2360
Fax.: +49-2241-14-4-2360
email: [EMAIL PROTECTED]

Reply via email to