Confusion in nutch-default between http.content.limit and file.content.limit
----------------------------------------------------------------------------

                 Key: NUTCH-900
                 URL: https://issues.apache.org/jira/browse/NUTCH-900
             Project: Nutch
          Issue Type: Improvement
    Affects Versions: 1.2
            Reporter: Markus Jelsma
            Priority: Trivial
             Fix For: 1.2


The http.content.limit and file.content.limit settings can be confusing and 
have fooled at least several users. The description element for these settings 
should be changed to reflect the difference between them so users won't be 
fooled that easy.
See also: 
http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
 for a discussion.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to