Hi everybody,

we've got a problem using Nutch: On the website that has to be crawled, there 
is a navigation on top of each page. Nutch crawls the navigation of each page 
which leads to the situation that for certain queries (that are included in the 
navigation) every page is delivered as a result.

Is there a way to tell Nutch to only crawl parts of a page like only the main 
content?

Thanks in advance and regards,
Christian 

Reply via email to