I would recommend to use the domain-urlfilter, it is the most straightforward method of controlling the list of hosts in the crawldb. M
-----Original message----- > From:Shadi Saleh <[email protected]> > Sent: Sunday 4th January 2015 16:23 > To: user <[email protected]> > Subject: Depth option > > Hello, > > I want to check this point please. > > I am using crawl to crawl www.example.com with depth =1 option, So if that > website contains url to other website e.g. www.example2.com nutch will not > crawl it , is it enogh to use depth option or should I use url filer? > > > Best > > > -- > > > > > *Shadi SalehPh.D StudentInstitute of Formal and Applied LinguisticsFaculty > of Mathematics and Physics* > *-Charles University in Prague* > > *16017 Prague 6 - Czech Republic Mob +420773515578* >

