Thanks Adil, crawldb is not empty, now it contains old and current folder, should I clean it before I start new crawl? what is the proper way?
Best On Sun, Jan 4, 2015 at 4:28 PM, Adil Ishaque Abbasi <[email protected]> wrote: > Yes, you are correct. no need to use the url filter. But this will work > only if your crawldb remains empty. > > Regards > Adil I. Abbasi > > On Sun, Jan 4, 2015 at 8:22 PM, Shadi Saleh <[email protected]> wrote: > > > Hello, > > > > I want to check this point please. > > > > I am using crawl to crawl www.example.com with depth =1 option, So if > that > > website contains url to other website e.g. www.example2.com nutch will > not > > crawl it , is it enogh to use depth option or should I use url filer? > > > > > > Best > > > > > > -- > > > > > > > > > > *Shadi SalehPh.D StudentInstitute of Formal and Applied > LinguisticsFaculty > > of Mathematics and Physics* > > *-Charles University in Prague* > > > > *16017 Prague 6 - Czech Republic Mob +420773515578* > > > -- *Shadi SalehPh.D StudentInstitute of Formal and Applied LinguisticsFaculty of Mathematics and Physics* *-Charles University in Prague* *16017 Prague 6 - Czech Republic Mob +420773515578*

