I believe you need to clean it. Regards Adil I. Abbasi
On Sun, Jan 4, 2015 at 8:35 PM, Shadi Saleh <[email protected] <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote: > Thanks Adil, > > crawldb is not empty, now it contains old and current folder, should I > clean it before I start new crawl? what is the proper way? > > Best > > On Sun, Jan 4, 2015 at 4:28 PM, Adil Ishaque Abbasi <[email protected] > <javascript:_e(%7B%7D,'cvml','[email protected]');>> > wrote: > > > Yes, you are correct. no need to use the url filter. But this will work > > only if your crawldb remains empty. > > > > Regards > > Adil I. Abbasi > > > > On Sun, Jan 4, 2015 at 8:22 PM, Shadi Saleh <[email protected] > <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote: > > > > > Hello, > > > > > > I want to check this point please. > > > > > > I am using crawl to crawl www.example.com with depth =1 option, So if > > that > > > website contains url to other website e.g. www.example2.com nutch will > > not > > > crawl it , is it enogh to use depth option or should I use url filer? > > > > > > > > > Best > > > > > > > > > -- > > > > > > > > > > > > > > > *Shadi SalehPh.D StudentInstitute of Formal and Applied > > LinguisticsFaculty > > > of Mathematics and Physics* > > > *-Charles University in Prague* > > > > > > *16017 Prague 6 - Czech Republic Mob +420773515578* > > > > > > > > > -- > > > > > *Shadi SalehPh.D StudentInstitute of Formal and Applied LinguisticsFaculty > of Mathematics and Physics* > *-Charles University in Prague* > > *16017 Prague 6 - Czech Republic Mob +420773515578* > -- Regards Adil I. Abbasi

