Pls refer to getText() method in
org.apache.nutch.parse.html.DOMContentUtils class (of course
parse-html plugin). You can add your filter easily;)

Wow! That was really easy. Thanks.
--Jeff

Reply via email to