Murat Ali Bayir wrote:
Assume that we have no restriction for max.inlinks, and we have two crawl namely crawl_depth1 than continue same crawl with crawl_depth2. There are two cases for obtainning final linkdb.
First one is run

./nutch invertlinks linkdb_depth1 segment_depth1
./nutch invertlinks linkdb_depth2 segment_depth2
./nutch mergelinkdb final_linkdb_1 linkdb_depth1 linkdb_depth2

and second one is run.

/nutch invertlinks final_linkdb2 segment_depth1 segment_depth2

is there any differenece between final_linkdb1 and final_linkdb2 ? I mean Is merge operation is loosless in this case?

It should be - if it's not then it's a bug that needs to be fixed.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to