Hello - Nutch treats those separate pages just as, what they in fact are, as separate pages. Markus
-----Original message----- > From:Ankit Goel <[email protected]> > Sent: Thursday 24th March 2016 6:29 > To: [email protected] > Subject: multi page news article > > Hi, > How does nutch or would nutch handle an article that goes on for 1-2 pages? > Usually we get a link at the bottom to the next page and in some-not all > cases- we get the option to view the full article together. And if it > follows the connecting links in true crawler fashion, does it recognize > that it is a continuation? or it is logged as a separate link? > > -- > Regards, > Ankit Goel > http://about.me/ankitgoel >

