Hi, 

I am trying to run the Nutch crawl in Linux OS but it hangs forever and I
dont see any error message. 

I have pasted the log messages below. 

2015-08-31 16:07:48,765 INFO  crawl.Injector - Injector: starting at
2015-08-31 16:07:48 
2015-08-31 16:07:48,766 INFO  crawl.Injector - Injector: crawlDb:
/sites/TEMP/crawldb 
2015-08-31 16:07:48,766 INFO  crawl.Injector - Injector: urlDir: ../seed_url 
2015-08-31 16:07:48,766 INFO  crawl.Injector - Injector: Converting injected
urls to crawl db entries. 
2015-08-31 16:07:49,314 WARN  util.NativeCodeLoader - Unable to load
native-hadoop library for your platform... using builtin-java classes where
applicable 
2015-08-31 16:07:49,432 WARN  snappy.LoadSnappy - Snappy native library not
loaded 
2015-08-31 16:07:51,006 INFO  regex.RegexURLNormalizer - can't find rules
for scope 'inject', using default 
2015-08-31 16:07:51,154 INFO  crawl.Injector - Injector: overwrite: false 
2015-08-31 16:07:51,154 INFO  crawl.Injector - Injector: update: false 

I tried to compile and run the Nutch source code on my windows machine (via
cygwin) and it runs without any issues. Can someone help me fix this issue? 

OS details: Linux 2.6.32-279.el6.x86_64 

Thanks, 
Barani



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-crawler-hangs-forever-Linux-OS-tp4226491.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to