Hi everybody,

No one has tried to help me. Any suggestion please ? 

Is there another place where I can ask my question if I'm not in the right
list ?Best regards,
 
Adnane

---------------------------------------------------------

From: Adnane Benjelloun [mailto:[email protected]] 
Sent: February 16, 2016 10:04 PM
To: [email protected]
Subject: fetch deletes all metadata except _csh_ and _rs_

Hello,

This problem happens at the second time I crawl a page

bin/nutch inject urls/
bin/nutch generate -topN 1000
bin/nutch fetch -all
bin/nutch parse -force -all
bin/nutch updatedb –all

second time :

bin/nutch generate -topN 1000 --> batchid changes for all existing pages
bin/nutch fetch -all --> *** metadatas are delete for all pages already
crawled **
bin/nutch parse -force -all
bin/nutch updatedb –all

I'm using mongodb

Any Help please ? I’m not sure if it’s a nutch bug or  it’s my
misunderstanding on nutch.

Best regards,

Adnane





Reply via email to