[ 
https://issues.apache.org/jira/browse/NUTCH-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13215583#comment-13215583
 ] 

Ferdy Galema commented on NUTCH-965:
------------------------------------

Ok that's it, I have reverted the changes completely. I am not sure what your 
cause is exactly, but I give you benefit of the doubt. Trunk and nutchgora are 
back to their previous states. Sorry for the inconvenience.

Lewis, could you reopen this issue. Have my mind on some other matters now, but 
I will look later back at this one.
                
> Skip parsing for truncated documents
> ------------------------------------
>
>                 Key: NUTCH-965
>                 URL: https://issues.apache.org/jira/browse/NUTCH-965
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Alexis
>            Assignee: Lewis John McGibbney
>             Fix For: nutchgora, 1.5
>
>         Attachments: NUTCH-965-v2.patch, NUTCH-965-v3-nutchgora.txt, 
> NUTCH-965-v3-trunk.txt, parserJob.patch
>
>
> The issue you're likely to run into when parsing truncated FLV files is 
> described here:
> http://www.mail-archive.com/[email protected]/msg01880.html
> The parser library gets stuck in infinite loop as it encounters corrupted 
> data due to for example truncating big binary files at fetch time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to