I checked the url you privided with parsechecker and they are parsed correctly. You can check yourself by doing bin/nutch parsechecker yoururl. In you implementation can you check if segment dir has correct permission.
Alex. -----Original Message----- From: CarinaBambina <[email protected]> To: user <[email protected]> Sent: Tue, Oct 9, 2012 10:03 am Subject: Re: Error parsing html i now also tried using all source files itself instead of the nutch.jar, but nothing changed. Is there anyone who has an idea what the reason for this error might be? Or at least where and what i should look for? Any hint?! Thanks in advance! -- View this message in context: http://lucene.472066.n3.nabble.com/Error-parsing-html-tp3994699p4012755.html Sent from the Nutch - User mailing list archive at Nabble.com.

