On Jul 26, 2009, at 7:24 AM, starz10de wrote:
Hi,
I am indexing a set of html websites using lucene (IndexHtml). The
indexer
work fine and I can also find the indexed term but the problem this
class
(IndexHtml) index all text inside the html site even the
advertisements. I
am interested
advertisements
or side links text.
Any help how to solve this problem? Did I use the class wrongly?
--
View this message in context:
http://www.nabble.com/Index-html-sites-using-IndexHtml-tp24666110p24666110.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com