[
https://issues.apache.org/jira/browse/CONNECTORS-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15271454#comment-15271454
]
Karl Wright edited comment on CONNECTORS-1311 at 5/4/16 9:34 PM:
-----------------------------------------------------------------
The jcifs version issue should be trivial to do.
For JAI, can you point me at where in the Tika dependency list this appears? I
did not see it.
Here is the list I used to update the dependencies we distribute:
http://mvnrepository.com/artifact/org.apache.tika/tika-parsers/1.12
was (Author: [email protected]):
The jcifs version issue should be trivial to do.
For JAI, can you point me at where in the Tika dependency list this appears? I
did not see it.
Here is the list I used to update the dependencies we distribute:
> Dependencies issues
> -------------------
>
> Key: CONNECTORS-1311
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1311
> Project: ManifoldCF
> Issue Type: Bug
> Components: Build
> Affects Versions: ManifoldCF 2.5
> Environment: any
> Reporter: Konstantin Avdeev
> Assignee: Karl Wright
> Fix For: ManifoldCF 2.5
>
>
> There are several issues with the dependencies:
> 1) POI should be 3.13, since tika 1.12 uses that version. With POI 3.14 tika
> cannot parse presentation files (ppt):
> {code}
> FATAL 2016-05-03 10:39:16,821 (Worker thread '0') - Error tossed:
> org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
> java.lang.NoSuchMethodError:
> org.apache.poi.xslf.usermodel.XSLFTextShape.getTextType()Lorg/apache/poi/xslf/usermodel/Placeholder;
> at
> org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.extractContent(XSLFPowerPointExtractorDecorator.java:154)
> at
> org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator.buildXHTML(XSLFPowerPointExtractorDecorator.java:88)
> at
> org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:110)
> at
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:112)
> at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> at
> org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:48)
> {code}
> 2) jcifs "1.3.17" is used currently. Available is "1.3.18".
> 3) Java Advanced Imaging (JAI), jbig2 format libs are not included, but
> required for parsing embedded images.
> Thank you!
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)