When I do a crawl I get this message.

run java in 
060110 195314 parsing
file:/downloads/nutt/nutch-0.7.1/conf/nutch-default.xml
060110 195318 parsing
file:/downloads/nutt/nutch-0.7.1/conf/crawl-tool.xml
060110 195318 parsing
file:/downloads/nutt/nutch-0.7.1/conf/nutch-site.xml
060110 195318 No FS indicated, using default:local
060110 195318 crawl started in: crawl.test
060110 195318 rootUrlFile = urls
060110 195318 threads = 10
060110 195318 depth = 3
060110 195320 Created webdb at
LocalFS,/downloads/nutt/nutch-0.7.1/crawl.test/db
060110 195320 Starting URL processing
060110 195321 Plugins: directory not found: plugins
Exception in thread "main" java.lang.ExceptionInInitializerError
   at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.6.0.0)
   at org.apache.nutch.db.WebDBInjector.addPage(java.lang.String)
(Unknown Source)
   at org.apache.nutch.db.WebDBInjector.injectURLFile(java.io.File)
(Unknown Source)
   at org.apache.nutch.db.WebDBInjector.main(java.lang.String[])
(Unknown Source)
   at org.apache.nutch.tools.CrawlTool.main(java.lang.String[]) (Unknown
Source)
   at gnu.java.lang.MainThread.call_main() (/usr/lib/libgcj.so.6.0.0)
   at gnu.java.lang.MainThread.run() (/usr/lib/libgcj.so.6.0.0)
Caused by: java.lang.RuntimeException: org.apache.nutch.net.URLFilter
not found.
   at org.apache.nutch.net.URLFilters.<clinit>() (Unknown Source)
   at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.6.0.0)
   ...6 more

Looks like It can't find the plugins directory
The nutch file has this in it.

if [ -d "$NUTCH_HOME/build/plugins" ]; then
  CLASSPATH=${CLASSPATH}:$NUTCH_HOME/build

But If I look a the structure of where nutch is unpacked, there is not
build directory?

andy


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to