Murat Ali Bayir wrote:
Assume that we have no restriction for max.inlinks, and we have two
crawl namely crawl_depth1 than continue same crawl with crawl_depth2.
There are two cases for obtainning final linkdb.
First one is run
./nutch invertlinks linkdb_depth1 segment_depth1
./nutch invertlinks linkdb_depth2 segment_depth2
./nutch mergelinkdb final_linkdb_1 linkdb_depth1 linkdb_depth2
and second one is run.
/nutch invertlinks final_linkdb2 segment_depth1 segment_depth2
is there any differenece between final_linkdb1 and final_linkdb2 ? I
mean Is merge operation is loosless in this case?
It should be - if it's not then it's a bug that needs to be fixed.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com