Hi, If I have to exclude some parts of a web page from getting indexed, how can I do it? As I understand, DOMContentUtils class of HTML parser plugin currently ignores only SCRIPT, STYLE and comment text. Can I configure it to exclude some other tags too?
Thanks, Kannan ------------------------------------------------------- This SF.Net email is sponsored by Yahoo. Introducing Yahoo! Search Developer Network - Create apps using Yahoo! Search APIs Find out how you can build Yahoo! directly into your own Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
