Is there a way for me to prevent nutch from fetching outlinks from pages
that I decide to be irrelevant (where I make the decision that a page is
irrelevant during the parsing of that page with my parse filter)?  I realize
that I can stop nutch from indexing such pages, but I believe the index is
separate from the structure that determines what new pages should be
fetched.

Best,
John

Reply via email to