Hi, i'm curious if you have come up with any solution yet? As i'm having the exact same problem! When i start the crawl the entered Url is parsed perfectly, but for all 'links' on this site i get: org.apache.nutch.parse.ParseException: Unable to successfully parse content I'm using Nutch 1.5. Thanks!
-- View this message in context: http://lucene.472066.n3.nabble.com/Error-parsing-html-tp3994699p4011436.html Sent from the Nutch - User mailing list archive at Nabble.com.

