Tim Allison created NUTCH-3020:
----------------------------------

             Summary: ParseSegment should check for protocol's flags for 
truncation
                 Key: NUTCH-3020
                 URL: https://issues.apache.org/jira/browse/NUTCH-3020
             Project: Nutch
          Issue Type: Task
            Reporter: Tim Allison


As discussed on the user list, several protocols can identify when a fetch has 
been truncated. ParseSegment only checks for the number of bytes fetched vs the 
http length header (if it exists). We should modify ParseSegment to check for 
notification of truncation from the protocols.

I noticed this specifically with okhttp, but other protocols may flag 
truncation as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to