I was trying to execute intranet crawling with nutch-0.7 code.
I did all the settings in config files (similar to nutch-0.6)
Inspite of these configurations it's giving me the following error:
Exception in thread "main" java.lang.ExceptionInInitializerError
at
org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
at
org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:378)
at
org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)
Caused by: java.lang.RuntimeException: org.apache.nutch.net.URLFilter
not found.
at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:44)
Has any one faced such issue??? Plz let me know if you have solution for
this..or do I need to modify any more configuration file other than
crawl-urlfilter.txt & nutch-site.xml ???