Dear Sam,
Sam Fisher wrote:
I'm probably not using Jericho
correctly, because the output of the parser is the same as what went in
(not stripped down to only the text content).
I also think so ;-). I experimented with Jericho in UIMA and did not
have any problems.
Has anyone had success using jericho with uima?
How did you use Jericho?
I did not have any problems with
new Source(new StringReader("<html>Te<b>s</b>t<html>")).getTextExtractor();
or in UIMA with
new Source(new StringReader(jCas.getDocumentText())).getTextExtractor();
Best regards,
Roman
--
Roman Klinger
Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI)
Schloss Birlinghoven
D-53754 Sankt Augustin
Tel.: +49-2241-14-2360
Fax.: +49-2241-14-4-2360
email: [EMAIL PROTECTED]