Hello - Nutch treats those separate pages just as, what they in fact are, as 
separate pages.
Markus

 
 
-----Original message-----
> From:Ankit Goel <[email protected]>
> Sent: Thursday 24th March 2016 6:29
> To: [email protected]
> Subject: multi page news article
> 
> Hi,
> How does nutch or would nutch handle an article that goes on for 1-2 pages?
> Usually we get a link at the bottom to the next page and in some-not all
> cases- we get the option to view the full article together. And if it
> follows the connecting links in true crawler fashion, does it recognize
> that it is a continuation? or it is logged as a separate link?
> 
> -- 
> Regards,
> Ankit Goel
> http://about.me/ankitgoel
> 

Reply via email to