[
https://issues.apache.org/jira/browse/TIKA-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072104#comment-14072104
]
Nicolas Belisle commented on TIKA-1191:
---------------------------------------
I was able to reproduce a similar issue with another file using Tika 1.5.
See attached eml.test and the test (Test.java).
The exception :
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected
RuntimeException from org.apache.tika.parser.mail.RFC822Parser@6743bc0f
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at org.apache.tika.fork.ForkServer.call(ForkServer.java:144)
at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124)
at org.apache.tika.fork.ForkServer.main(ForkServer.java:69)
Caused by: java.lang.NullPointerException
at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:158)
at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:516)
at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:51)
at
org.apache.tika.parser.mail.RFC822Parser.adaptedExtractMultipart(RFC822Parser.java:167)
at
org.apache.tika.parser.mail.RFC822Parser.adaptedExtractMultipart(RFC822Parser.java:156)
at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:101)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
... 9 more
> ForkParser / ClassLoaderProxy does not define package
> -----------------------------------------------------
>
> Key: TIKA-1191
> URL: https://issues.apache.org/jira/browse/TIKA-1191
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.4
> Reporter: Nicolas Belisle
> Attachments: ClassLoaderProxy.java.patch, Test.java, test.eml
>
>
> ForkParser will throw an Exception in some cases :
> org.apache.tika.exception.TikaException: Invalid embedded resource
> at
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:189)
> at
> org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:135)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:186)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at org.apache.tika.fork.ForkServer.call(ForkServer.java:144)
> at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124)
> at org.apache.tika.fork.ForkServer.main(ForkServer.java:69)
> Caused by: java.lang.NullPointerException
> at
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:136)
> at
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:499)
> at
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
> at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
> at
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
> at
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getTikaConfig(AbstractPOIFSExtractor.java:72)
> at
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getDetector(AbstractPOIFSExtractor.java:79)
> at
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:176)
> ... 10 more
> A patch will follow
--
This message was sent by Atlassian JIRA
(v6.2#6252)