Yes, this is currently a bug in trunk which errors out when the content
for a given url is null. This bug is in process of being fixed.
Dennis
Alexis Votta wrote:
I have updated my copy of Nutch from subversion to revision 597822.
With minimal settings like nutch-site.xml, crawl-urlfilter.txt and a
crawl with the command: bin/nutch crawl urls -dir crawl -depth 2 -topN
10 -threads 3 the crawl runs fine.
But on trying to verify the crawl with this command: bin/nutch
org.apache.nutch.searcher.NutchBean alexis
I get this error
Total hits: 11
Exception in thread "main" java.lang.RuntimeException:
java.lang.NullPointerException
at
org.apache.nutch.searcher.FetchedSegments.getSummary(FetchedSegments.java:204)
at org.apache.nutch.searcher.NutchBean.getSummary(NutchBean.java:342)
at org.apache.nutch.searcher.NutchBean.main(NutchBean.java:393)
Caused by: java.lang.NullPointerException
at
org.apache.nutch.searcher.FetchedSegments.getSummary(FetchedSegments.java:159)
at
org.apache.nutch.searcher.FetchedSegments$SummaryThread.run(FetchedSegments.java:177)
Is this some bug in trunk?