Hi Arthur,

You can use crawlId parameter. With the parameter gora will create
different hbase tables. For example your crawlId is custom nutch will
create custom_webpage table in hbase.

HTH

2015-05-02 10:56 GMT+03:00 Arthur Chan <[email protected]>:
> Hi,
>
> My Nutch is 2.3.1 and Gora 0.6 + HBase, I need to crawl another set of
> URLs, what is the correct step to completely clear existing fetch DB in
> nutch?
>
> I have tried to drop the 'webpage' table in Hbase, changed the seed.txt
> file, however I find Nutch here still remembers the previous fetch data set
> and continues the crawling from there.
>
> Please advise.
> Regards



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Reply via email to