Github user jskora commented on the issue:

    https://github.com/apache/nifi/pull/252
  
    So, the problems you had with the JPG file are two separate issues.
    
    - The default `BodyContentHandler` passed to the parser can only handle 
100,000 bytes, so any file larger than that produces that message and is only 
partially parsed.  **_I fixed this by adding an optional `Content Buffer Size` 
property to allow that to be increased or set to unlimited and related tests._**
    - The Tika JPEG parser appears to have a reference error.  It references 
the class `com.drew.lang.BufferReader` but that does not appear to be in the 
Tika package or the version of the Drew Noakes Metadata Extractor package that 
Tika references.  **_I'm looking into whether updating to a newer Tika will fix 
this or if we have dependency conflicts causing the problem._**


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to