Also, have a look at the jakarta-lucene-sandbox CVS repository in contributions/ant. It indexes HTML content using JTidy to strip tags.

        Erik


On May 20, 2004, at 1:42 AM, Mahesh wrote:

I am using the lucene 1.4 to index the information.
I have lot of HTML tags in the information that i will be indexing ,so
let me know if their is any way of removing the HTML tags from being
indexed..


MAHESH




--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to