hi,
i'm performing a RECRAWL using the recrawl.sh script, and i had this error when
inverting the links:
FATAL crawl.LinkDb - LinkDb: java.io.IOException: lock file
crawl/linkdb/.locked already exists
echo "----- Invert Links (Step 4 of $steps) -----"
$NUTCH_HOME/bin/nutch invertlinks $crawl/linkdb $crawl/segments/*
i understood that the linkdb already exists (because of the last crawl). my
question is: should i delete or backup the old linkdb (at every recrawl) before
iverting links ?
_________________________________________________________________
Eligible CDN College & University students can upgrade to Windows 7 before Jan
3 for only $39.99. Upgrade now!
http://go.microsoft.com/?linkid=9691819