Confusion in nutch-default between http.content.limit and file.content.limit
----------------------------------------------------------------------------
Key: NUTCH-900
URL: https://issues.apache.org/jira/browse/NUTCH-900
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.2
Reporter: Markus Jelsma
Priority: Trivial
Fix For: 1.2
The http.content.limit and file.content.limit settings can be confusing and
have fooled at least several users. The description element for these settings
should be changed to reflect the difference between them so users won't be
fooled that easy.
See also:
http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
for a discussion.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.