Re: why did nutch0.8.1 fetch empty content from certain sites?

Jason Culverhouse Thu, 08 Feb 2007 09:57:44 -0800

It could b related to http://issues.apache.org/jira/browse/NUTCH-374when the property http.content.limit is set to -1and the data from the server is gzip'ed the content is not decodedproperly.

Jason

On Feb 8, 2007, at 6:45 AM, wangxu wrote:

wangxu wrote:
when I fetched  some certain sites,
I got empty content,contentType,but the fetch status was"fetch_success" and the metadata was sometimes not empty.
how does website configure itself to achieve this?
any methods to avoid this situation?
I used agent-name:
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; MyIE2; .NET CLR1.1.4322)
sorry,empty content,parsedtext/parseddata

Re: why did nutch0.8.1 fetch empty content from certain sites?

Reply via email to