Github user JPercivall commented on the issue:
https://github.com/apache/nifi/pull/252
I set this up to analyze all the files I have in my downloads folder to see
what happened and what errors I came across. I found a couple interesting ones.
The first should be a configurable property. The second looks like problems
attempting to extract metadata from JPGs. You can recreate the problem by
setting up a unit test to analyze simple.jpg (in test resources):
org.apache.tika.sax.WriteOutContentHandler$WriteLimitReachedException: Your
document contained more than 100000 characters, and so your requested limit has
been reached. To receive the full text of the document, increase your limit.
(Text up to the limit is however available).
2016-06-15 15:33:22,677 ERROR [Timer-Driven Process Thread-7]
o.a.n.p.media.ExtractMediaMetadata
ExtractMediaMetadata[id=c4e52258-dac5-43b1-b951-2d7f9a7ebf6c]
ExtractMediaMetadata[id=c4e52258-dac5-43b1-b951-2d7f9a7ebf6c] failed to process
due to java.lang.NoClassDefFoundError: com/drew/lang/BufferReader; rolling back
session: java.lang.NoClassDefFoundError: com/drew/lang/BufferReader
2016-06-15 15:33:22,679 ERROR [Timer-Driven Process Thread-7]
o.a.n.p.media.ExtractMediaMetadata
java.lang.NoClassDefFoundError: com/drew/lang/BufferReader
at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:56)
~[na:na]
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256) ~[na:na]
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256) ~[na:na]
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
~[na:na]
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:136)
~[na:na]
at
org.apache.nifi.processors.media.ExtractMediaMetadata.tika_parse(ExtractMediaMetadata.java:239)
~[na:na]
at
org.apache.nifi.processors.media.ExtractMediaMetadata.access$000(ExtractMediaMetadata.java:71)
~[na:na]
at
org.apache.nifi.processors.media.ExtractMediaMetadata$1.process(ExtractMediaMetadata.java:215)
~[na:na]
at
org.apache.nifi.controller.repository.StandardProcessSession.read(StandardProcessSession.java:1806)
~[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.controller.repository.StandardProcessSession.read(StandardProcessSession.java:1777)
~[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.processors.media.ExtractMediaMetadata.onTrigger(ExtractMediaMetadata.java:211)
~[na:na]
at
org.apache.nifi.processor.AbstractProcessor.onTrigger(AbstractProcessor.java:27)
~[nifi-api-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.controller.StandardProcessorNode.onTrigger(StandardProcessorNode.java:1139)
[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.call(ContinuallyRunProcessorTask.java:139)
[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.call(ContinuallyRunProcessorTask.java:49)
[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
org.apache.nifi.controller.scheduling.TimerDrivenSchedulingAgent$1.run(TimerDrivenSchedulingAgent.java:124)
[nifi-framework-core-0.6.0-SNAPSHOT.jar:0.6.0-SNAPSHOT]
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
[na:1.8.0_74]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
[na:1.8.0_74]
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
[na:1.8.0_74]
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
[na:1.8.0_74]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
[na:1.8.0_74]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[na:1.8.0_74]
at java.lang.Thread.run(Thread.java:745) [na:1.8.0_74]
Caused by: java.lang.ClassNotFoundException: com.drew.lang.BufferReader
at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
~[na:1.8.0_74]
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
~[na:1.8.0_74]
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
~[na:1.8.0_74]
... 23 common frames omitted
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---