Simone, Would you mind opening a JIRA for this and attach your patch + grant it to ASF? I know it is fairly small but it makes it easier to track the progress, link to svn commits, etc...
Thanks Julien On 23 August 2011 07:53, Simone Frenzel <[email protected]> wrote: > > > ---------- Forwarded message ---------- > From: Simone Frenzel <[email protected]> > Date: 2011/8/22 > Subject: Patch für httpResponse > To: [email protected] > > > Hi, > > tested nutch on differnt webpages. In case of a short ziped pages it > thrwos an IO_Exception: > java.io.IOException: unzipBestEffort returned null > 2011-08-19 17:06:55,190 ERROR httpclient.Http - at > org.apache.nutch.protocol.http.api.HttpBase.processGzipEncoded(HttpBase.java:310) > 2011-08-19 17:06:55,191 ERROR httpclient.Http - at > org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:163) > 2011-08-19 17:06:55,191 ERROR httpclient.Http - at > org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:154) > 2011-08-19 17:06:55,191 ERROR httpclient.Http - at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:138) > 2011-08-19 17:06:55,191 ERROR httpclient.Http - at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628) > > a little change on HttpResponse solve the problem - now there is no > problem with zipped Pages, BaiscAuth and Zipped Pages ... anymore. > > Patch is attched. > > Greetings and thanks > > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com

