Hi guys,
I am using Nutch 0.8.1, for the past 2 days I have been getting the
following exception: Java.Lang.IllegalStateException. The exception started
after I implementing the Nutch-61 patch; Adaptive Re-crawl Interval. In
short, this happens:
I am trying to crawl XML files (locally and remotely on a web server), once
crawl, the fetcher sends the file to their processing parsers. This is where
the exception is thrown as the parsers launches but do not perform any
activity on the file. If anybody has dealt with this type error, please let
me know how to get rid of it. Below is an extract from my log file.
2007-01-18 14:16:16,371 INFO parse.xml - XMLParser config path : ..
2007-01-18 14:16:16,371 INFO parse.xml - XMLParser config path : ..
2007-01-18 14:16:16,371 WARN fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_12.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set
2007-01-18 14:16:16,371 WARN fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_11.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set
2007-01-18 14:16:16,387 INFO parse.xml - XMLParser config path : ..
2007-01-18 14:16:16,403 WARN fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_13.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set
2007-01-18 14:16:16,403 INFO parse.xml - XMLParser config path : ..
2007-01-18 14:16:16,403 WARN fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_14.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set
2007-01-18 14:16:16,418 INFO parse.xml - XMLParser config path : ..
2007-01-18 14:16:16,418 WARN fetcher.Fetcher - Error parsing:
file:/C:/880254/8802_583254_20051006_10.xml: failed(2,200):
java.lang.IllegalStateException: Root element not set
2007-01-18 14:16:17,887 INFO fetcher.Fetcher - Fetcher: done
If a root element is not set within an XML file, a nullpointer exception is
thrown not an illegalstateexception. #can anyone put some lights on this
error.
Thanks.
Armel
-------------------------------------------------
Armel T. Nene
iDNA Solutions
Tel: +44 (207) 257 6124
Mobile: +44 (788) 695 0483
<http://blog.idna-solutions.com/> http://blog.idna-solutions.com
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers