If you browse the cvs of nutch.org you will found an implementation.

HTH
Stefan


Am 10.01.2004 um 19:43 schrieb [EMAIL PROTECTED]:


Hi group,

would it be possible to implement a Analyser who filters HTML code out of a
HTML page. As a result I would have only the text free of any tagging.


Is is maybe better to use other existing open source software for that? Did
somebody tried that here?


Cheers,
Ralf

--
+++ GMX - die erste Adresse f�r Mail, Message, More +++
Neu: Preissenkung f�r MMS und FreeMMS! http://www.gmx.net



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to