Hi guys,
I am using Nutch 0.8.1, for the past 2 days I have been getting the following exception: Java.Lang.IllegalStateException. The exception started after I implementing the Nutch-61 patch; Adaptive Re-crawl Interval. In short, this happens: I am trying to crawl XML files (locally and remotely on a web server), once crawl, the fetcher sends the file to their processing parsers. This is where the exception is thrown as the parsers launches but do not perform any activity on the file. If anybody has dealt with this type error, please let me know how to get rid of it. Below is an extract from my log file. 2007-01-18 14:16:16,371 INFO parse.xml - XMLParser config path : .. 2007-01-18 14:16:16,371 INFO parse.xml - XMLParser config path : .. 2007-01-18 14:16:16,371 WARN fetcher.Fetcher - Error parsing: file:/C:/880254/8802_583254_20051006_12.xml: failed(2,200): java.lang.IllegalStateException: Root element not set 2007-01-18 14:16:16,371 WARN fetcher.Fetcher - Error parsing: file:/C:/880254/8802_583254_20051006_11.xml: failed(2,200): java.lang.IllegalStateException: Root element not set 2007-01-18 14:16:16,387 INFO parse.xml - XMLParser config path : .. 2007-01-18 14:16:16,403 WARN fetcher.Fetcher - Error parsing: file:/C:/880254/8802_583254_20051006_13.xml: failed(2,200): java.lang.IllegalStateException: Root element not set 2007-01-18 14:16:16,403 INFO parse.xml - XMLParser config path : .. 2007-01-18 14:16:16,403 WARN fetcher.Fetcher - Error parsing: file:/C:/880254/8802_583254_20051006_14.xml: failed(2,200): java.lang.IllegalStateException: Root element not set 2007-01-18 14:16:16,418 INFO parse.xml - XMLParser config path : .. 2007-01-18 14:16:16,418 WARN fetcher.Fetcher - Error parsing: file:/C:/880254/8802_583254_20051006_10.xml: failed(2,200): java.lang.IllegalStateException: Root element not set 2007-01-18 14:16:17,887 INFO fetcher.Fetcher - Fetcher: done If a root element is not set within an XML file, a nullpointer exception is thrown not an illegalstateexception. #can anyone put some lights on this error. Thanks. Armel ------------------------------------------------- Armel T. Nene iDNA Solutions Tel: +44 (207) 257 6124 Mobile: +44 (788) 695 0483 <http://blog.idna-solutions.com/> http://blog.idna-solutions.com
