[ 
https://issues.apache.org/jira/browse/NUTCH-2715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel resolved NUTCH-2715.
------------------------------------
    Resolution: Fixed

Fixed with NUTCH-2716 ([PR #454|https://github.com/apache/nutch/pull/454]). 
Thanks, [~yossi]!

> WARCExporter fails on large records
> -----------------------------------
>
>                 Key: NUTCH-2715
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2715
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.15
>            Reporter: Yossi Tamari
>            Priority: Major
>             Fix For: 1.16
>
>
> com.martinkl.warc.WARCRecord throws an IllegalStateException when a single 
> line is over 10,000 bytes. Since this exception is not caught in 
> WARCExporter, it fails the whole export.
> I doubt that validity of the limitation in WARCRecord, but regardless, I 
> think WARCExporter should catch the exception and skip to the next record.
> (See also [https://github.com/ept/warc-hadoop/issues/5])



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to