I looked at the history of this.  We had to release a patch (2.8.1) that
put various poi jars at root level in order to work around a Tika problem.
That patch may not have been entirely correct in that it looks like it may
have blocked access by one of the deeper jars to a higher level.

Release 2.9 should fix this if I am correct.

Karl


On Tue, Jan 9, 2018 at 6:39 AM, Karl Wright <[email protected]> wrote:

> What version of MCF is this?  That's important to know since Tika has had
> problems with this kind of thing in the past and this looks like something
> similar.
>
> The problem you are reporting is due to either a missing jar, or a bug in
> an internal tika classloader.  But I need to know whether this is a current
> bug or not, since we just went to a new Tika version.
>
> Karl
>
>
> On Tue, Jan 9, 2018 at 4:32 AM, msaunier <[email protected]> wrote:
>
>> Hello Karl,
>>
>> I hope you are well today.
>>
>>
>>
>> I have 2 problems with ManifoldCF.
>>
>>
>>
>> -----------
>>
>> In **Outputs connectors** with Solr connector. I have add a « Maximum
>> document length and I have « Excluded 5 mime types » but it not work. I
>> join capture.
>>
>>
>>
>> ----------
>>
>> And in second, I have a **Tika exception** in ManifoldCF. 3 documents
>> are blocked :
>>
>>
>>
>> FATAL 2018-01-09T10:19:54,992 (Worker thread '5') - Error tossed:
>> org.apache.poi.hwmf.record.HwmfFont.getCharSet()Lorg/apache/
>> poi/hwmf/record/HwmfFont$WmfCharset;
>>
>> java.lang.NoSuchMethodError: org.apache.poi.hwmf.record.Hwm
>> fFont.getCharSet()Lorg/apache/poi/hwmf/record/HwmfFont$WmfCharset;
>>
>>         at 
>> org.apache.tika.parser.microsoft.WMFParser.parse(WMFParser.java:74)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
>> ~[?:?]
>>
>>         at org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.
>> parseEmbedded(ParsingEmbeddedDocumentExtractor.java:102) ~[?:?]
>>
>>         at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtracto
>> r.handleEmbeddedFile(AbstractOOXMLExtractor.java:375) ~[?:?]
>>
>>         at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtracto
>> r.handleEmbeddedPart(AbstractOOXMLExtractor.java:260) ~[?:?]
>>
>>         at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtracto
>> r.handleEmbeddedParts(AbstractOOXMLExtractor.java:205) ~[?:?]
>>
>>         at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtracto
>> r.getXHTML(AbstractOOXMLExtractor.java:142) ~[?:?]
>>
>>         at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory
>> .parse(OOXMLExtractorFactory.java:142) ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:106)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>> ~[?:?]
>>
>>         at 
>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
>> ~[?:?]
>>
>>         at 
>> org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:74)
>> ~[?:?]
>>
>>         at org.apache.manifoldcf.agents.transformation.tika.TikaExtract
>> or.addOrReplaceDocumentWithException(TikaExtractor.java:235) ~[?:?]
>>
>>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIn
>> gester$PipelineAddEntryPoint.addOrReplaceDocumentWithExcepti
>> on(IncrementalIngester.java:3226) ~[mcf-agents.jar:?]
>>
>>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIn
>> gester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)
>> ~[mcf-agents.jar:?]
>>
>>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIn
>> gester$PipelineObjectWithVersions.addOrReplaceDocumentWithEx
>> ception(IncrementalIngester.java:2708) ~[mcf-agents.jar:?]
>>
>>         at org.apache.manifoldcf.agents.incrementalingest.IncrementalIn
>> gester.documentIngest(IncrementalIngester.java:756) ~[mcf-agents.jar:?]
>>
>>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessAct
>> ivity.ingestDocumentWithException(WorkerThread.java:1583)
>> ~[mcf-pull-agent.jar:?]
>>
>>         at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessAct
>> ivity.ingestDocumentWithException(WorkerThread.java:1548)
>> ~[mcf-pull-agent.jar:?]
>>
>>         at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDr
>> iveConnector.processDocuments(SharedDriveConnector.java:939) ~[?:?]
>>
>>         at 
>> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
>> [mcf-pull-agent.jar:?]
>>
>>
>>
>> I need to create an incident ticket?
>>
>>
>>
>> ----------
>>
>>
>>
>> Thanks for your help.
>>
>>
>>
>> Cordialement,
>>
>>
>>
>> [image: msaunier]
>>
>>
>>
>>
>>
>>
>>
>
>

Reply via email to