Hey Look at the file Test.java under lucene1.4 ,it strips out html tagsand gives u content...
with regards Karthik -----Original Message----- From: root [mailto:root]On Behalf Of Mahesh Sent: Thursday, May 20, 2004 11:13 AM To: [EMAIL PROTECTED] Subject: How do i prevent the HTML tags being added to Lucene Index.. I am using the lucene 1.4 to index the information. I have lot of HTML tags in the information that i will be indexing ,so let me know if their is any way of removing the HTML tags from being indexed.. MAHESH --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
