Hi
I was trying to upgrade poi version to 3.11 with tika 1.5 which seems 
noncompatible ( got some AdbstractMetodError for while Excel processing ). So 
tried to move to tika 1.6 where I see my test cases failing ( for a .pptx file 
with some image) with below exception  ( full stacktrace is attached along with 
my integration test scenario)

java.lang.ClassNotFoundException: com.drew.metadata.MetadataException not found
by org.apache.tika.parsers [19]
        at org.apache.felix.framework.BundleWiringImpl.findClassOrResourceByDele
gation(BundleWiringImpl.java:1556)
        at org.apache.felix.framework.BundleWiringImpl.access$400(BundleWiringIm
pl.java:77)
        at org.apache.felix.framework.BundleWiringImpl$BundleClassLoader.loadCla
ss(BundleWiringImpl.java:1993)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:56)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
)


Explictely tried putting upstream lib dependency in my pom ,but that di dnot 
help either
-------------------------------------------------------------
<dependency>
            <groupId>com.drewnoakes</groupId>
            <artifactId>metadata-extractor</artifactId>
            <version>2.6.2</version>
            <scope>test</scope>
</dependency>
-------------------------------------------------------------


-Ashok
P.S. Tried  with tika 1.7 also , but no luck.
ÿþ



-------------------------------------------------------

 Test case

-------------------------------------------------------



@RunWith(JUnit4TestRunner.class)

public class OfficeParserIT {





    @Configuration

    public Option[] configuration() 
throws IOException, URISyntaxException 
{

        return CoreOptions.options(

                junitBundles(),

                
mavenBundle("org.apache.felix", 
"org.apache.felix.scr", "1.6.0"),

                
testBundle("tika-core.jar"),

                
testBundle("tika-parsers.jar"),

                
testBundle("commons-codec.jar"),

                
testBundle("commons-compress.jar"),

                
testBundle("com.adobe.granite.poi.jar"));

    }



    private UrlProvisionOption 
testBundle(String name)

            throws 
MalformedURLException {

        File target = new 
File("target");

        File bundles = new File(target, 
"test-bundles");

        return bundle(new File(bundles, 
name).toURI().toURL().toString());

    }

 @Test

    public void testPowerPointParser() 
throws Exception {

        //assertParse("PPT.ppt");

        assertParse("PPT2007.pptx");

    }



 private void assertParse(String name) 
throws Exception {

        URL url = 
OfficeParserIT.class.getResource(name);



        String content = new 
Tika().parseToString(url);

        assertTrue(name, 
content.contains("\u30AB\u30BF\u30AB\u30CA"));

        assertTrue(name, 
content.contains("\u8907\u6570"));



        Set<String> words = new 
HashSet<String>();

        words.addAll(Arrays.asList(

                
content.toLowerCase(Locale.US).split("\\s+")));

        assertTrue(name, 
words.contains("single"));

        assertTrue(name, 
words.contains("byte"));

    }





-------------------------------------------------------

 Error Stack trace

-------------------------------------------------------

Running 
com.adobe.granite.poi.OfficeParserIT

Tests run: 4, Failures: 0, Errors: 1, 
Skipped: 0, Time elapsed: 5.519 sec <<< 
FA

ILURE! - in 
com.adobe.granite.poi.OfficeParserIT

testPowerPointParser(com.adobe.granite.poi.OfficeParserIT)
  Time elapsed: 1.903

sec  <<< ERROR!

java.lang.ClassNotFoundException: 
com.drew.metadata.MetadataException not 
found

by org.apache.tika.parsers [19]

        at 
org.apache.felix.framework.BundleWiringImpl.findClassOrResourceByDele

gation(BundleWiringImpl.java:1556)

        at 
org.apache.felix.framework.BundleWiringImpl.access$400(BundleWiringIm

pl.java:77)

        at 
org.apache.felix.framework.BundleWiringImpl$BundleClassLoader.loadCla

ss(BundleWiringImpl.java:1993)

        at 
java.lang.ClassLoader.loadClass(ClassLoader.java:358)

        at 
org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:56)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1

21)

        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:7

2)

        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbed

ded(ParsingEmbeddedDocumentExtractor.java:102)

        at 
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.handleT

humbnail(AbstractOOXMLExtractor.java:158)

        at 
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTM

L(AbstractOOXMLExtractor.java:118)

        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OO

XMLExtractorFactory.java:112)

        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.

java:82)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244

)

        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1

21)

        at 
org.apache.tika.Tika.parseToString(Tika.java:465)

        at 
org.apache.tika.Tika.parseToString(Tika.java:577)

        at 
com.adobe.granite.poi.OfficeParserIT.assertParse(OfficeParserIT.java:

95)

        at 
com.adobe.granite.poi.OfficeParserIT.testPowerPointParser(OfficeParse

rIT.java:84)





Results :



Tests in error:

  
OfficeParserIT.testPowerPointParser:84->assertParse:95
 W% ClassNotFound com.dre

...



Tests run: 4, Failures: 0, Errors: 1, 
Skipped: 0



[INFO] 
------------------------------------------------------------------------

[INFO] BUILD FAILURE

[INFO] 
------------------------------------------------------------------------

[INFO] Total time: 10.987s

[INFO] Finished at: Fri Jun 19 23:36:49 
IST 2015

[INFO] Final Memory: 20M/218M

[INFO] 
------------------------------------------------------------------------

Attachment: PPT2007.pptx
Description: PPT2007.pptx

Reply via email to