In the process of crawling and indexing, some pages are just used as
"temporary links " to the pages I want to index, so how can I control those
kinds of pages not being indexed? Or which part of nutch should I extend?

Reply via email to