I believe you need to clean it.

Regards
Adil I. Abbasi

On Sun, Jan 4, 2015 at 8:35 PM, Shadi Saleh <[email protected]
<javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:

> Thanks Adil,
>
> crawldb is not empty, now it contains old and current folder, should I
> clean it before I start new crawl? what is the proper way?
>
> Best
>
> On Sun, Jan 4, 2015 at 4:28 PM, Adil Ishaque Abbasi <[email protected]
> <javascript:_e(%7B%7D,'cvml','[email protected]');>>
> wrote:
>
> > Yes, you are correct. no need to use the url filter. But this will work
> > only if your crawldb remains empty.
> >
> > Regards
> > Adil I. Abbasi
> >
> > On Sun, Jan 4, 2015 at 8:22 PM, Shadi Saleh <[email protected]
> <javascript:_e(%7B%7D,'cvml','[email protected]');>> wrote:
> >
> > > Hello,
> > >
> > > I want to check this point please.
> > >
> > > I am using crawl to crawl www.example.com with depth =1 option, So if
> > that
> > > website contains url to other website e.g. www.example2.com nutch will
> > not
> > > crawl it , is it enogh to use depth option or should I use url filer?
> > >
> > >
> > > Best
> > >
> > >
> > > --
> > >
> > >
> > >
> > >
> > > *Shadi SalehPh.D StudentInstitute of Formal and Applied
> > LinguisticsFaculty
> > > of Mathematics and Physics*
> > > *-Charles University in Prague*
> > >
> > > *16017 Prague 6 - Czech Republic Mob +420773515578*
> > >
> >
>
>
>
> --
>
>
>
>
> *Shadi SalehPh.D StudentInstitute of Formal and Applied LinguisticsFaculty
> of Mathematics and Physics*
> *-Charles University in Prague*
>
> *16017 Prague 6 - Czech Republic Mob +420773515578*
>



-- 
Regards
Adil I. Abbasi

Reply via email to