Hi Benjamin, It may be malformed XML that we don't treat properly. If the log has a full stack trace that would be very helpful. If not can you get one?
Thanks! Karl Sent from my Windows Phone ------------------------------ From: Benjamin Brandmeier Sent: 10/30/2013 6:51 AM To: [email protected] Subject: RSS Crawl -> NullPointerException Hi everyone, I'm facing a problem with the RSS connector. The feed I'm crawling is --> http://blog.fme.de/feed A NPE occurs at processing time. After some debugging I've found out the following: Variable previousContext is null in method --> public final void endElement(String namespace, String localName, String qName) Parameter qName is content:encoded, but there are many tags like this in the feed, so I'm not sure about at which point the error occurs. The variable previousContext(=null) is written to currentContext. As the stack trace shows, the error happens at org.apache.manifoldcf.core.fuzzyml.XMLFuzzyHierarchicalParseState.cleanup(XMLFuzzyHierarchicalParseState.java:86), at this point currentContext.cleanup(); is called with currentContext = null. manifoldcf.log is attached. Any thoughts on this? I tried different settings regarding dechromed content. Benjamin
