I've been reviewing the four different merge commands (as of nutch v0.9):
$ nutch | grep merg
mergedb merge crawldb-s, with optional filtering
mergesegs merge several segments, with optional filtering and slicing
mergelinkdb merge linkdb-s, with optional filtering
Hi
On 7/16/07, Kai_testing Middleton [EMAIL PROTECTED] wrote:
I've been reviewing the four different merge commands (as of nutch v0.9):
$ nutch | grep merg
mergedb merge crawldb-s, with optional filtering
mergesegs merge several segments, with optional filtering and
/nutch updatedb $d/crawldb $s
bin/nutch invertlinks $d/linkdb $d/segments
bin/nutch index $d/indexes $d/crawldb $d/linkdb $s
bin/nutch dedup $d/indexes
bin/nutch merge $d/index $d/indexes
--- end ---
So once fetch operation is terminated then the rest of the tasks is
executed anyway (updatedb
I merged my index and its off my nutch dir... so i have
index, segments and db if that helps?
On Thu, 8 Sep 2005 09:15:33 -0400
Jay Pound [EMAIL PROTECTED] wrote:
when I merge the index where do I put it? does it still
need to be in the segments folder? I've merged it, and
tried to start