Hello Eyeris, Thank you very much for your suggestion. Sorry for my late reply.
Using the urls filter plugins is a good option. I am doing this for my current crawling task. However, using urls filters is not exactly what I want. I feel there should be some better ways to restrict nutch only crawl the links on designated web pages. Currently, maybe nutch does not provide such a feature. Best, Junqiang On Sun, Jan 31, 2016 at 9:26 PM, Eyeris Rodriguez Rueda <[email protected]> wrote: > Hello Jun. > Maybe you can use nutch´s urls filter plugins. This plugins are used to > filter o restrict the visit of links. > Please i need more details about your situation. > > 1-How are selected the link to visit on your pages(A, B, C) , it has some > pattern,subdomain or some keyword in url´s links?

