Hi Edward,

Host table is using for Host Based configuration like maxThreads, crawlDelay, mincrawlDelay etc. But this tables is option.

In normal usage Host table dont create. Can you explain how do you start your crawler ?

Talat


07-11-2013 08:22 tarihinde, [email protected] yazdı:

Hi,everyone,
I'm new to Nutch and using Nutch2.2.1 with Hbase as the datastore.When I finished a whole round of crawing,I found 
"host","webtable" in the Hbase. As to the "host" table,I am not quite sure about it's 
function, like in which step(inject,generate,fetch,parse,updatedb,updatehostdb) is this "host" table get involved 
in?  And what does the data stored in the 'host" table really mean?  Can anyone share some information? Thank a lot!


Edward


Reply via email to