Rather than making up some invalid HTML tag, I'd recommend you separate your navigation section with a <div> or <span> with a special class="navigation" or something like that. Then use JTidy to ignore such tags that have that class. Then you get valid, clean HTML and the ability to filter it for indexing.
Erik
On Thursday, January 30, 2003, at 04:56 AM, Michael Wechner wrote:
Hi
I am looking for an HTMLParser which skips text tagged by
<no-index> or something similar. This way I could exclude for
instance a "global navigation section" within the HTML
<no-index>
International<br>
Business<br>
Science<br>
...
</no-index>
It seems that the current demo/HTMLParser (http://lucene.sourceforge.net/cgi-bin/faq/ faqmanager.cgi?file=chapter.indexing&toc=faq#q11)
is not capable of doing something like that.
Any pointers are very welcome.
Thanks a lot
Michael
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
