Hi guys,

 

I am using Nutch 0.8.1, for the past 2 days I have been getting the
following exception: Java.Lang.IllegalStateException. The exception started
after I implementing the Nutch-61 patch; Adaptive Re-crawl Interval. In
short, this happens:

 

I am trying to crawl XML files (locally and remotely on a web server), once
crawl, the fetcher sends the file to their processing parsers. This is where
the exception is thrown as the parsers launches but do not perform any
activity on the file. If anybody has dealt with this type error, please let
me know how to get rid of it. Below is an extract from my log file.

 

2007-01-18 14:16:16,371 INFO  parse.xml - XMLParser config path : ..

2007-01-18 14:16:16,371 INFO  parse.xml - XMLParser config path : ..

2007-01-18 14:16:16,371 WARN  fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_12.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set

2007-01-18 14:16:16,371 WARN  fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_11.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set

2007-01-18 14:16:16,387 INFO  parse.xml - XMLParser config path : ..

2007-01-18 14:16:16,403 WARN  fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_13.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set

2007-01-18 14:16:16,403 INFO  parse.xml - XMLParser config path : ..

2007-01-18 14:16:16,403 WARN  fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_14.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set

2007-01-18 14:16:16,418 INFO  parse.xml - XMLParser config path : ..

2007-01-18 14:16:16,418 WARN  fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_10.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set

2007-01-18 14:16:17,887 INFO  fetcher.Fetcher - Fetcher: done

 

If a root element is not set within an XML file, a nullpointer exception is
thrown not an illegalstateexception. #can anyone put some lights on this
error. 

 

Thanks.

 

Armel

 

-------------------------------------------------

Armel T. Nene

iDNA Solutions

Tel: +44 (207) 257 6124

Mobile: +44 (788) 695 0483 

 <http://blog.idna-solutions.com/> http://blog.idna-solutions.com

 

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to