I was trying to execute intranet crawling with nutch-0.7 code.

I did all the settings in config files (similar to nutch-0.6)

Inspite of these configurations it's giving me the following error:

 

Exception in thread "main" java.lang.ExceptionInInitializerError

        at
org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)

        at
org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:378)

        at
org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)

        at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)

Caused by: java.lang.RuntimeException: org.apache.nutch.net.URLFilter
not found.

        at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:44)

 

 

Has any one faced such issue??? Plz let me know if you have solution for
this..or do I need to modify any more configuration file other than
crawl-urlfilter.txt & nutch-site.xml ???

 

Reply via email to