My nutch crawl is hanging here: (Any ideas why?)

 

2010-11-04 13:25:55,616 INFO  crawl.Injector - Injector: starting at
2010-11-04 13:25:55

2010-11-04 13:25:55,617 INFO  crawl.Injector - Injector: crawlDb:
/lib/nutch/crawl/crawldb

2010-11-04 13:25:55,617 INFO  crawl.Injector - Injector: urlDir:
/lib/nutch/seed

2010-11-04 13:25:55,618 INFO  crawl.Injector - Injector: Converting injected
urls to crawl db entries.

2010-11-04 13:25:56,800 ERROR crawl.Generator - Generator:
java.io.IOException: lock file /lib/nutch/crawl/crawldb/.locked already
exists.

                at
org.apache.nutch.util.LockUtil.createLockFile(LockUtil.java:44)

                at
org.apache.nutch.crawl.Generator.generate(Generator.java:474)

                at org.apache.nutch.crawl.Generator.run(Generator.java:692)

                at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

                at org.apache.nutch.crawl.Generator.main(Generator.java:648)

 

2010-11-04 13:25:58,540 INFO  segment.SegmentMerger - Merging 1 segments to
/lib/nutch/crawl/MERGEDsegments/20101104132558

2010-11-04 13:25:58,543 WARN  segment.SegmentMerger - Input dir
/lib/nutch/crawl/segments/* doesn't exist, skipping.

2010-11-04 13:25:58,543 INFO  segment.SegmentMerger - SegmentMerger: using
segment data from: content crawl_generate crawl_fetch crawl_parse parse_data
parse_text

2010-11-04 13:25:58,575 WARN  mapred.JobClient - Use GenericOptionsParser
for parsing the arguments. Applications should implement Tool for the same.

2010-11-04 13:25:59,625 INFO  crawl.LinkDb - LinkDb: starting at 2010-11-04
13:25:59

2010-11-04 13:25:59,626 INFO  crawl.LinkDb - LinkDb: linkdb:
/lib/nutch/crawl/linkdb

2010-11-04 13:25:59,626 INFO  crawl.LinkDb - LinkDb: URL normalize: true

2010-11-04 13:25:59,626 INFO  crawl.LinkDb - LinkDb: URL filter: true

2010-11-04 13:25:59,635 INFO  crawl.LinkDb - LinkDb: adding segment:
/lib/nutch/crawl/segments/*

2010-11-04 13:26:00,584 INFO  solr.SolrIndexer - SolrIndexer: starting at
2010-11-04 13:26:00

2010-11-04 13:26:00,652 INFO  indexer.IndexerMapReduce - IndexerMapReduce:
crawldb: /lib/nutch/crawl/crawldb

2010-11-04 13:26:00,652 INFO  indexer.IndexerMapReduce - IndexerMapReduce:
linkdb: /lib/nutch/crawl/linkdb

2010-11-04 13:26:00,652 INFO  indexer.IndexerMapReduce - IndexerMapReduces:
adding segment: /lib/nutch/crawl/segments/*

Reply via email to