Hi,
After I input the command:
bin/nutch crawl urls -dir crawled -depth 3
it shows:
crawl started in: crawled
rootUrlDir = urls
threads = 10
depth = 3
Injector: starting
Injector: crawlDb: crawled/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Injector: Merging injected urls into crawl db.
Injector: done
Generator: Selecting best-scoring urls due for fetch.
Generator: starting
Generator: segment: crawled/segments/20071115180641
Generator: filtering: false
Generator: topN: 2147483647
Generator: Partitioning selected urls by host, for politeness.
Generator: done.
Fetcher: starting
Fetcher: segment: crawled/segments/20071115180641
Fetcher: done
CrawlDb update: starting
CrawlDb update: db: crawled/crawldb
CrawlDb update: segments: [crawled/segments/20071115180641]
CrawlDb update: additions allowed: true
CrawlDb update: URL normalizing: true
CrawlDb update: URL filtering: true
CrawlDb update: Merging segment data into db.
CrawlDb update: done
Generator: Selecting best-scoring urls due for fetch.
Generator: starting
Generator: segment: crawled/segments/20071115180846
Generator: filtering: false
Generator: topN: 2147483647
Generator: 0 records selected for fetching, exiting ...
Stopping at depth=1 - no more URLs to fetch.
LinkDb: starting
LinkDb: linkdb: crawled/linkdb
LinkDb: URL normalize: true
LinkDb: URL filter: true
LinkDb: adding segment:
hdfs://node01:9000/user/nutch/crawled/segments/20071115180641
LinkDb: done
Indexer: starting
Indexer: linkdb: crawled/linkdb
Indexer: adding segment:
hdfs://node01:9000/user/nutch/crawled/segments/20071115180641
Indexer: done
Dedup: starting
Dedup: adding indexes in: crawled/indexes
Dedup: done
merging indexes to: crawled/index
Adding hdfs://node01:9000/user/nutch/crawled/indexes/part-00000
Adding hdfs://node01:9000/user/nutch/crawled/indexes/part-00001
done merging
crawl finished: crawled
Then I input the command:
$bin/hadoop dfs -copyToLocal crawled /nutch/home/crawl
$bin/nutch org.apache.nutch.searcher.NutchBean hit
it shows the error:
Exception in thread "main" java.lang.IllegalArgumentException: URI is not
absolute
at java.io.File.<init>(File.java:361)
at
org.apache.nutch.searcher.IndexSearcher.getDirectory(IndexSearcher.java:87)
at org.apache.nutch.searcher.IndexSearcher.<init>(IndexSearcher.java:73)
at org.apache.nutch.searcher.NutchBean.init(NutchBean.java:117)
at org.apache.nutch.searcher.NutchBean.<init>(NutchBean.java:104)
at org.apache.nutch.searcher.NutchBean.<init>(NutchBean.java:82)
at org.apache.nutch.searcher.NutchBean.main(NutchBean.java:386)
How Can I slove this problem?