[ 
https://issues.apache.org/jira/browse/NUTCH-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gerard Bouchar updated NUTCH-2562:
----------------------------------
    Description: 
While reading chunked content, if the content size becomes larger than 
http.getMaxContent(), instead of just stopping and truncate the content, it 
tries to read a new chunk before having read the previous one completely, 
resulting in a '{color:#333333}bad chunk length' error.{color}

 

{color:#333333}See: 
https://github.com/apache/nutch/blob/master/src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java#L440-L442{color}

  was:While reading chunked content, if the content size becomes larger than 
http.getMaxContent(), instead of just stopping and truncate the content, it 
tries to read a new chunk before having read the previous one completely, 
resulting in a '{color:#333333}bad chunk length' error.{color}


> protocol-http fails to read large chunked HTTP responses
> --------------------------------------------------------
>
>                 Key: NUTCH-2562
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2562
>             Project: Nutch
>          Issue Type: Sub-task
>            Reporter: Gerard Bouchar
>            Priority: Major
>
> While reading chunked content, if the content size becomes larger than 
> http.getMaxContent(), instead of just stopping and truncate the content, it 
> tries to read a new chunk before having read the previous one completely, 
> resulting in a '{color:#333333}bad chunk length' error.{color}
>  
> {color:#333333}See: 
> https://github.com/apache/nutch/blob/master/src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java#L440-L442{color}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to