[
https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-900:
--------------------------------
Fix Version/s: 2.0
Affects Version/s: 2.0
To be fixed in the trunk as well
> Confusion in nutch-default between http.content.limit and file.content.limit
> ----------------------------------------------------------------------------
>
> Key: NUTCH-900
> URL: https://issues.apache.org/jira/browse/NUTCH-900
> Project: Nutch
> Issue Type: Improvement
> Affects Versions: 1.2, 2.0
> Reporter: Markus Jelsma
> Priority: Trivial
> Fix For: 1.2, 2.0
>
> Attachments: NUTCH-900.MarkusJelsma.100908.patch.txt
>
>
> The http.content.limit and file.content.limit settings can be confusing and
> have fooled at least several users. The description element for these
> settings should be changed to reflect the difference between them so users
> won't be fooled that easy.
> See also:
> http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
> for a discussion.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.