I use Nutch-1.1.
I want to add a plugin to parse webpage and store it in my database, I add a
class implements HtmlParseFilter,
but found that even when the page is redirect to another
page, HtmlParseFilter still get called .
I thought ParseSegment.parse would be better, but why nutch1.1 use parse
fuction both in Fether.output method and ParseSegment.parse?

thanks in advance.

Tiger
2011-1-18

Reply via email to