Github user jskora commented on the issue:
https://github.com/apache/nifi/pull/252
So, the problems you had with the JPG file are two separate issues.
- The default `BodyContentHandler` passed to the parser can only handle
100,000 bytes, so any file larger than that produces that message and is only
partially parsed. **_I fixed this by adding an optional `Content Buffer Size`
property to allow that to be increased or set to unlimited and related tests._**
- The Tika JPEG parser appears to have a reference error. It references
the class `com.drew.lang.BufferReader` but that does not appear to be in the
Tika package or the version of the Drew Noakes Metadata Extractor package that
Tika references. **_I'm looking into whether updating to a newer Tika will fix
this or if we have dependency conflicts causing the problem._**
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---