Pls refer to getText() method in org.apache.nutch.parse.html.DOMContentUtils class (of course parse-html plugin). You can add your filter easily;)
Wow! That was really easy. Thanks. --Jeff
Pls refer to getText() method in org.apache.nutch.parse.html.DOMContentUtils class (of course parse-html plugin). You can add your filter easily;)
Wow! That was really easy. Thanks. --Jeff