Hello,
I am having some weird problems installing Tika, specifically tika-parsers, on
a Mac, OSX. For some reasons, all sorts of tests fail once I get to the
installation stage of tika-parsers; They are all related to pdfbox and log4j. I
have little knowledge of using Maven, so maybe my problem is simply based in my
limited knowledge about Maven, but I cannot figure out what to do from here.
Therefore, I posted this to the Maven user list, too, as I am not sure whom to
actually address. Please excuse any inconvenience from this "cross-posting".
I've attached the surefire reports for a better overview of all failed tests as
well as those with errors. I tried using both the official Tika 0.9 release and
the SVN version, but both do not pass the tests, with fairly similar failures;
here are the results for the SVN pre-1.0 release, after running "mvn clean
install" in the main directory (tika-site).
tika-parsers test failures:
Results :
Failed tests:
testWordHTML(org.apache.tika.parser.microsoft.WordParserTest)
testSimple(org.apache.tika.parser.mail.RFC822ParserTest)
testMultipart(org.apache.tika.parser.mail.RFC822ParserTest)
testQuotedPrintable(org.apache.tika.parser.mail.RFC822ParserTest)
testBase64(org.apache.tika.parser.mail.RFC822ParserTest)
testI18NHeaders(org.apache.tika.parser.mail.RFC822ParserTest)
testWordHTML(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)
Tests in error:
testPDF(org.apache.tika.parser.AutoDetectParserTest)
testEmbeddedOfficeFiles(org.apache.tika.parser.microsoft.POIContainerExtractionTest)
testPdfParsing(org.apache.tika.parser.pdf.PDFParserTest)
testCustomMetadata(org.apache.tika.parser.pdf.PDFParserTest)
testProtectedPDF(org.apache.tika.parser.pdf.PDFParserTest)
testGzipParsing(org.apache.tika.parser.pkg.GzipParserTest)
testTarParsing(org.apache.tika.parser.pkg.TarParserTest)
testUnusualFromAddress(org.apache.tika.parser.mail.RFC822ParserTest)
testZipParsing(org.apache.tika.parser.pkg.ZipParserTest)
testBzip2Parsing(org.apache.tika.parser.pkg.Bzip2ParserTest)
testPDFExtraction(org.apache.tika.TestParsers)
testZipFileExtraction(org.apache.tika.TestParsers)
About the failed tests:
The mail.RFC822ParserTest seems to stem from problems with log4j.Category
(NoClassDefFoundError), which strikes me as peculiar, as I would assume Maven
takes care of those dependencies and the log4j JAR is in my Maven repository.
The microsoft.WordParserTest fails because it complains that it is missing an
image; This strikes me as peculiar, but maybe I can just ignore that failure.
And the microsoft.ooxml.OOXMLParserTest fails too, without any useful
indication about what went wrong, but then I am not planning to use it, so I
probably can ignore that one, too.
About the test errors:
These all fail at the exact same point:
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
The pom.xml of Tika specifies pdfbox 1.5.0; Checking out line 880 yields:
PDFParser parser = new PDFParser( new BufferedInputStream( input ),
scratchFile, force);
PDDocument.java also imports PDFParser (line 48):
import org.apache.pdfbox.pdfparser.PDFParser;
And the class file of PDFParser is found in the JAR file for pdfbox 1.5.0
(according to the dep. of tika-parser's POM), which also is in my m2 repository
(~/.m2/repository). So why doesn't this work, why is there a NoClassDef error,
similar to the RFC822ParserTest problems? I tried the most obvious thing,
simply adding the log4j and pdfbox JARs to the CLASSPATH, but that at least did
not help. Maybe this is some thing with Maven I don't understand, but I'd
really appreciate some help or hints here.
output of "mvn --version":
Apache Maven 2.2.1 (r801777; 2009-08-06 21:16:01+0200)
Java version: 1.6.0_24
Java home: /System/Library/Java/JavaVirtualMachines/1.6.0.jdk/Contents/Home
Default locale: en_US, platform encoding: MacRoman
OS name: "mac os x" version: "10.6.6" arch: "x86_64" Family: "mac"
(Note: I also tried using Maven 3, to no avail)
Thank you!
--Florian
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.Bzip2ParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.339 sec <<<
FAILURE!
testBzip2Parsing(org.apache.tika.parser.pkg.Bzip2ParserTest) Time elapsed:
0.224 sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.decompress(PackageExtractor.java:135)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:90)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pkg.Bzip2ParserTest.testBzip2Parsing(Bzip2ParserTest.java:40)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.TarParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.099 sec <<<
FAILURE!
testTarParsing(org.apache.tika.parser.pkg.TarParserTest) Time elapsed: 0.06
sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pkg.TarParserTest.testTarParsing(TarParserTest.java:40)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.mail.RFC822ParserTest
-------------------------------------------------------------------------------
Tests run: 6, Failures: 5, Errors: 1, Skipped: 0, Time elapsed: 0.184 sec <<<
FAILURE!
testSimple(org.apache.tika.parser.mail.RFC822ParserTest) Time elapsed: 0.131
sec <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at junit.framework.Assert.fail(Assert.java:47)
at
org.apache.tika.parser.mail.RFC822ParserTest.testSimple(RFC822ParserTest.java:61)
testMultipart(org.apache.tika.parser.mail.RFC822ParserTest) Time elapsed:
0.027 sec <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at junit.framework.Assert.fail(Assert.java:47)
at
org.apache.tika.parser.mail.RFC822ParserTest.testMultipart(RFC822ParserTest.java:82)
testQuotedPrintable(org.apache.tika.parser.mail.RFC822ParserTest) Time
elapsed: 0.001 sec <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at junit.framework.Assert.fail(Assert.java:47)
at
org.apache.tika.parser.mail.RFC822ParserTest.testQuotedPrintable(RFC822ParserTest.java:117)
testBase64(org.apache.tika.parser.mail.RFC822ParserTest) Time elapsed: 0.003
sec <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at junit.framework.Assert.fail(Assert.java:47)
at
org.apache.tika.parser.mail.RFC822ParserTest.testBase64(RFC822ParserTest.java:132)
testI18NHeaders(org.apache.tika.parser.mail.RFC822ParserTest) Time elapsed:
0.001 sec <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at junit.framework.Assert.fail(Assert.java:47)
at
org.apache.tika.parser.mail.RFC822ParserTest.testI18NHeaders(RFC822ParserTest.java:149)
testUnusualFromAddress(org.apache.tika.parser.mail.RFC822ParserTest) Time
elapsed: 0.001 sec <<< ERROR!
org.apache.commons.logging.LogConfigurationException:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:543)
at
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:235)
at
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:209)
at org.apache.commons.logging.LogFactory.getLog(LogFactory.java:351)
at
org.apache.james.mime4j.parser.AbstractEntity.<init>(AbstractEntity.java:85)
at org.apache.james.mime4j.parser.MimeEntity.<init>(MimeEntity.java:67)
at
org.apache.james.mime4j.parser.MimeTokenStream.doParse(MimeTokenStream.java:173)
at
org.apache.james.mime4j.parser.MimeTokenStream.parse(MimeTokenStream.java:131)
at
org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:89)
at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60)
at
org.apache.tika.parser.mail.RFC822ParserTest.testUnusualFromAddress(RFC822ParserTest.java:163)
Caused by: org.apache.commons.logging.LogConfigurationException: No suitable
Log constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category)
at
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:413)
at
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:529)
... 35 more
Caused by: java.lang.NoClassDefFoundError: org/apache/log4j/Category
at java.lang.Class.getDeclaredConstructors0(Native Method)
at java.lang.Class.privateGetDeclaredConstructors(Class.java:2389)
at java.lang.Class.getConstructor0(Class.java:2699)
at java.lang.Class.getConstructor(Class.java:1657)
at
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:410)
... 36 more
Caused by: java.lang.ClassNotFoundException: org.apache.log4j.Category
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at sun.misc.Launcher$ExtClassLoader.findClass(Launcher.java:229)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
... 41 more
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.POIContainerExtractionTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 1.436 sec <<<
FAILURE!
testEmbeddedOfficeFiles(org.apache.tika.parser.microsoft.POIContainerExtractionTest)
Time elapsed: 1.055 sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.extractor.ParserContainerExtractor.extract(ParserContainerExtractor.java:81)
at
org.apache.tika.extractor.ParserContainerExtractor$RecursiveParser.parse(ParserContainerExtractor.java:127)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedResource(AbstractPOIFSExtractor.java:74)
at
org.apache.tika.parser.microsoft.OutlookExtractor.parse(OutlookExtractor.java:129)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:205)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.extractor.ParserContainerExtractor.extract(ParserContainerExtractor.java:81)
at
org.apache.tika.parser.microsoft.AbstractPOIContainerExtractionTest.process(AbstractPOIContainerExtractionTest.java:58)
at
org.apache.tika.parser.microsoft.POIContainerExtractionTest.testEmbeddedOfficeFiles(POIContainerExtractionTest.java:246)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.WordParserTest
-------------------------------------------------------------------------------
Tests run: 4, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.109 sec <<<
FAILURE!
testWordHTML(org.apache.tika.parser.microsoft.WordParserTest) Time elapsed:
0.054 sec <<< FAILURE!
junit.framework.AssertionFailedError: Image not found in:
at junit.framework.Assert.fail(Assert.java:47)
at junit.framework.Assert.assertTrue(Assert.java:20)
at
org.apache.tika.parser.microsoft.WordParserTest.testWordHTML(WordParserTest.java:128)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.ZipParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.15 sec <<<
FAILURE!
testZipParsing(org.apache.tika.parser.pkg.ZipParserTest) Time elapsed: 0.083
sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:96)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pkg.ZipParserTest.testZipParsing(ZipParserTest.java:40)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.GzipParserTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.176 sec <<<
FAILURE!
testGzipParsing(org.apache.tika.parser.pkg.GzipParserTest) Time elapsed: 0.075
sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.decompress(PackageExtractor.java:135)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:93)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pkg.GzipParserTest.testGzipParsing(GzipParserTest.java:40)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.AutoDetectParserTest
-------------------------------------------------------------------------------
Tests run: 17, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 4.111 sec <<<
FAILURE!
testPDF(org.apache.tika.parser.AutoDetectParserTest) Time elapsed: 0.04 sec
<<< ERROR!
java.lang.ExceptionInInitializerError
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:148)
at
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:78)
at
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:111)
at
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:127)
at
org.apache.tika.parser.AutoDetectParserTest.testPDF(AutoDetectParserTest.java:173)
Caused by: org.apache.commons.logging.LogConfigurationException:
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by
org.apache.commons.logging.LogConfigurationException: No suitable Log
constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
at
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:543)
at
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:235)
at
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:209)
at org.apache.commons.logging.LogFactory.getLog(LogFactory.java:351)
at org.apache.pdfbox.pdfparser.BaseParser.<clinit>(BaseParser.java:58)
... 37 more
Caused by: org.apache.commons.logging.LogConfigurationException: No suitable
Log constructor [Ljava.lang.Class;@622a11f3 for
org.apache.commons.logging.impl.Log4JLogger (Caused by
java.lang.NoClassDefFoundError: org/apache/log4j/Category)
at
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:413)
at
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:529)
... 41 more
Caused by: java.lang.NoClassDefFoundError: org/apache/log4j/Category
at java.lang.Class.getDeclaredConstructors0(Native Method)
at java.lang.Class.privateGetDeclaredConstructors(Class.java:2389)
at java.lang.Class.getConstructor0(Class.java:2699)
at java.lang.Class.getConstructor(Class.java:1657)
at
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:410)
... 42 more
Caused by: java.lang.ClassNotFoundException: org.apache.log4j.Category
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at sun.misc.Launcher$ExtClassLoader.findClass(Launcher.java:229)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
... 47 more
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
-------------------------------------------------------------------------------
Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 1.114 sec <<<
FAILURE!
testWordHTML(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest) Time
elapsed: 0.125 sec <<< FAILURE!
junit.framework.AssertionFailedError
at junit.framework.Assert.fail(Assert.java:47)
at junit.framework.Assert.assertTrue(Assert.java:20)
at junit.framework.Assert.assertTrue(Assert.java:27)
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testWordHTML(OOXMLParserTest.java:337)
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pdf.PDFParserTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed: 0.073 sec <<<
FAILURE!
testPdfParsing(org.apache.tika.parser.pdf.PDFParserTest) Time elapsed: 0.019
sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pdf.PDFParserTest.testPdfParsing(PDFParserTest.java:44)
testCustomMetadata(org.apache.tika.parser.pdf.PDFParserTest) Time elapsed:
0.035 sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pdf.PDFParserTest.testCustomMetadata(PDFParserTest.java:78)
testProtectedPDF(org.apache.tika.parser.pdf.PDFParserTest) Time elapsed: 0.016
sec <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.pdf.PDFParserTest.testProtectedPDF(PDFParserTest.java:111)
-------------------------------------------------------------------------------
Test set: org.apache.tika.TestParsers
-------------------------------------------------------------------------------
Tests run: 17, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 1.201 sec <<<
FAILURE!
testPDFExtraction(org.apache.tika.TestParsers) Time elapsed: 0.029 sec <<<
ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.Tika.parseToString(Tika.java:357)
at org.apache.tika.Tika.parseToString(Tika.java:423)
at org.apache.tika.Tika.parseToString(Tika.java:403)
at
org.apache.tika.utils.ParseUtils.getStringContent(ParseUtils.java:180)
at org.apache.tika.TestParsers.testPDFExtraction(TestParsers.java:46)
testZipFileExtraction(org.apache.tika.TestParsers) Time elapsed: 0.059 sec
<<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.pdfbox.pdfparser.PDFParser
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
at
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
at
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:96)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.Tika.parseToString(Tika.java:357)
at org.apache.tika.Tika.parseToString(Tika.java:423)
at org.apache.tika.Tika.parseToString(Tika.java:403)
at
org.apache.tika.utils.ParseUtils.getStringContent(ParseUtils.java:180)
at
org.apache.tika.TestParsers.testZipFileExtraction(TestParsers.java:202)