[ 
https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-900:
--------------------------------

    Patch Info: [Patch Available]

> Confusion in nutch-default between http.content.limit and file.content.limit
> ----------------------------------------------------------------------------
>
>                 Key: NUTCH-900
>                 URL: https://issues.apache.org/jira/browse/NUTCH-900
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.2
>            Reporter: Markus Jelsma
>            Priority: Trivial
>             Fix For: 1.2
>
>         Attachments: NUTCH-900.MarkusJelsma.100908.patch.txt
>
>
> The http.content.limit and file.content.limit settings can be confusing and 
> have fooled at least several users. The description element for these 
> settings should be changed to reflect the difference between them so users 
> won't be fooled that easy.
> See also: 
> http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
>  for a discussion.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to