[ 
https://issues.apache.org/jira/browse/NUTCH-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471152#comment-16471152
 ] 

ASF GitHub Bot commented on NUTCH-2562:
---------------------------------------

sebastian-nagel opened a new pull request #329: NUTCH-2562 protocol-http fails 
to read large chunked HTTP responses
URL: https://github.com/apache/nutch/pull/329
 
 
   - if http.content.limit is reached skip remaining chunked content
     including any headers in the trailer

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> protocol-http fails to read large chunked HTTP responses
> --------------------------------------------------------
>
>                 Key: NUTCH-2562
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2562
>             Project: Nutch
>          Issue Type: Sub-task
>    Affects Versions: 1.14
>            Reporter: Gerard Bouchar
>            Priority: Major
>             Fix For: 1.15
>
>
> While reading chunked content, if the content size becomes larger than 
> http.getMaxContent(), instead of just stopping and truncate the content, it 
> tries to read a new chunk before having read the previous one completely, 
> resulting in a '{color:#333333}bad chunk length' error.{color}
>  
> {color:#333333}See: 
> https://github.com/apache/nutch/blob/master/src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java#L440-L442{color}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to