Hello,

I am having some weird problems installing Tika, specifically tika-parsers, on 
a Mac, OSX. For some reasons, all sorts of tests fail once I get to the 
installation stage of tika-parsers; They are all related to pdfbox and log4j. I 
have little knowledge of using Maven, so maybe my problem is simply based in my 
limited knowledge about Maven, but I cannot figure out what to do from here. 
Therefore, I posted this to the Maven user list, too, as I am not sure whom to 
actually address. Please excuse any inconvenience from this  "cross-posting".

I've attached the surefire reports for a better overview of all failed tests as 
well as those with errors. I tried using both the official Tika 0.9 release and 
the SVN version, but both do not pass the tests, with fairly similar failures; 
here are the results for the SVN pre-1.0 release, after running "mvn clean 
install" in the main directory (tika-site).

tika-parsers test failures:

Results :

Failed tests: 
 testWordHTML(org.apache.tika.parser.microsoft.WordParserTest)
 testSimple(org.apache.tika.parser.mail.RFC822ParserTest)
 testMultipart(org.apache.tika.parser.mail.RFC822ParserTest)
 testQuotedPrintable(org.apache.tika.parser.mail.RFC822ParserTest)
 testBase64(org.apache.tika.parser.mail.RFC822ParserTest)
 testI18NHeaders(org.apache.tika.parser.mail.RFC822ParserTest)
 testWordHTML(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)

Tests in error: 
 testPDF(org.apache.tika.parser.AutoDetectParserTest)
 
testEmbeddedOfficeFiles(org.apache.tika.parser.microsoft.POIContainerExtractionTest)
 testPdfParsing(org.apache.tika.parser.pdf.PDFParserTest)
 testCustomMetadata(org.apache.tika.parser.pdf.PDFParserTest)
 testProtectedPDF(org.apache.tika.parser.pdf.PDFParserTest)
 testGzipParsing(org.apache.tika.parser.pkg.GzipParserTest)
 testTarParsing(org.apache.tika.parser.pkg.TarParserTest)
 testUnusualFromAddress(org.apache.tika.parser.mail.RFC822ParserTest)
 testZipParsing(org.apache.tika.parser.pkg.ZipParserTest)
 testBzip2Parsing(org.apache.tika.parser.pkg.Bzip2ParserTest)
 testPDFExtraction(org.apache.tika.TestParsers)
 testZipFileExtraction(org.apache.tika.TestParsers)

About the failed tests:

The mail.RFC822ParserTest seems to stem from problems with log4j.Category 
(NoClassDefFoundError), which strikes me as peculiar, as I would assume Maven 
takes care of those dependencies and the log4j JAR is in my Maven repository.

The microsoft.WordParserTest fails because it complains that it is missing an 
image; This strikes me as peculiar, but maybe I can just ignore that failure. 
And the microsoft.ooxml.OOXMLParserTest fails too, without any useful 
indication about what went wrong, but then I am not planning to use it, so I 
probably can ignore that one, too.

About the test errors:

These all fail at the exact same point:

java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)

The pom.xml of Tika specifies pdfbox 1.5.0; Checking out line 880 yields:

PDFParser parser = new PDFParser( new BufferedInputStream( input ), 
scratchFile, force); 

PDDocument.java also imports PDFParser (line 48):

import org.apache.pdfbox.pdfparser.PDFParser;

And the class file of PDFParser is found in the JAR file for pdfbox 1.5.0 
(according to the dep. of tika-parser's POM), which also is in my m2 repository 
(~/.m2/repository). So why doesn't this work, why is there a NoClassDef error, 
similar to the RFC822ParserTest problems? I tried the most obvious thing, 
simply adding the log4j and pdfbox JARs to the CLASSPATH, but that at least did 
not help. Maybe this is some thing with Maven I don't understand, but I'd 
really appreciate some help or hints here.

output of "mvn --version":

Apache Maven 2.2.1 (r801777; 2009-08-06 21:16:01+0200)
Java version: 1.6.0_24
Java home: /System/Library/Java/JavaVirtualMachines/1.6.0.jdk/Contents/Home
Default locale: en_US, platform encoding: MacRoman
OS name: "mac os x" version: "10.6.6" arch: "x86_64" Family: "mac"

(Note: I also tried using Maven 3, to no avail)

Thank you!

--Florian

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.Bzip2ParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.339 sec <<< 
FAILURE!
testBzip2Parsing(org.apache.tika.parser.pkg.Bzip2ParserTest)  Time elapsed: 
0.224 sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.decompress(PackageExtractor.java:135)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:90)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pkg.Bzip2ParserTest.testBzip2Parsing(Bzip2ParserTest.java:40)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.TarParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.099 sec <<< 
FAILURE!
testTarParsing(org.apache.tika.parser.pkg.TarParserTest)  Time elapsed: 0.06 
sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pkg.TarParserTest.testTarParsing(TarParserTest.java:40)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.mail.RFC822ParserTest
-------------------------------------------------------------------------------
Tests run: 6, Failures: 5, Errors: 1, Skipped: 0, Time elapsed: 0.184 sec <<< 
FAILURE!
testSimple(org.apache.tika.parser.mail.RFC822ParserTest)  Time elapsed: 0.131 
sec  <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at junit.framework.Assert.fail(Assert.java:47)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testSimple(RFC822ParserTest.java:61)

testMultipart(org.apache.tika.parser.mail.RFC822ParserTest)  Time elapsed: 
0.027 sec  <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at junit.framework.Assert.fail(Assert.java:47)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testMultipart(RFC822ParserTest.java:82)

testQuotedPrintable(org.apache.tika.parser.mail.RFC822ParserTest)  Time 
elapsed: 0.001 sec  <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at junit.framework.Assert.fail(Assert.java:47)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testQuotedPrintable(RFC822ParserTest.java:117)

testBase64(org.apache.tika.parser.mail.RFC822ParserTest)  Time elapsed: 0.003 
sec  <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at junit.framework.Assert.fail(Assert.java:47)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testBase64(RFC822ParserTest.java:132)

testI18NHeaders(org.apache.tika.parser.mail.RFC822ParserTest)  Time elapsed: 
0.001 sec  <<< FAILURE!
junit.framework.AssertionFailedError: Exception thrown: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at junit.framework.Assert.fail(Assert.java:47)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testI18NHeaders(RFC822ParserTest.java:149)

testUnusualFromAddress(org.apache.tika.parser.mail.RFC822ParserTest)  Time 
elapsed: 0.001 sec  <<< ERROR!
org.apache.commons.logging.LogConfigurationException: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at 
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:543)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:235)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:209)
        at org.apache.commons.logging.LogFactory.getLog(LogFactory.java:351)
        at 
org.apache.james.mime4j.parser.AbstractEntity.<init>(AbstractEntity.java:85)
        at org.apache.james.mime4j.parser.MimeEntity.<init>(MimeEntity.java:67)
        at 
org.apache.james.mime4j.parser.MimeTokenStream.doParse(MimeTokenStream.java:173)
        at 
org.apache.james.mime4j.parser.MimeTokenStream.parse(MimeTokenStream.java:131)
        at 
org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:89)
        at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60)
        at 
org.apache.tika.parser.mail.RFC822ParserTest.testUnusualFromAddress(RFC822ParserTest.java:163)
Caused by: org.apache.commons.logging.LogConfigurationException: No suitable 
Log constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:413)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:529)
        ... 35 more
Caused by: java.lang.NoClassDefFoundError: org/apache/log4j/Category
        at java.lang.Class.getDeclaredConstructors0(Native Method)
        at java.lang.Class.privateGetDeclaredConstructors(Class.java:2389)
        at java.lang.Class.getConstructor0(Class.java:2699)
        at java.lang.Class.getConstructor(Class.java:1657)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:410)
        ... 36 more
Caused by: java.lang.ClassNotFoundException: org.apache.log4j.Category
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at sun.misc.Launcher$ExtClassLoader.findClass(Launcher.java:229)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
        ... 41 more

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.POIContainerExtractionTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 1.436 sec <<< 
FAILURE!
testEmbeddedOfficeFiles(org.apache.tika.parser.microsoft.POIContainerExtractionTest)
  Time elapsed: 1.055 sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.extractor.ParserContainerExtractor.extract(ParserContainerExtractor.java:81)
        at 
org.apache.tika.extractor.ParserContainerExtractor$RecursiveParser.parse(ParserContainerExtractor.java:127)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedResource(AbstractPOIFSExtractor.java:74)
        at 
org.apache.tika.parser.microsoft.OutlookExtractor.parse(OutlookExtractor.java:129)
        at 
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:205)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.extractor.ParserContainerExtractor.extract(ParserContainerExtractor.java:81)
        at 
org.apache.tika.parser.microsoft.AbstractPOIContainerExtractionTest.process(AbstractPOIContainerExtractionTest.java:58)
        at 
org.apache.tika.parser.microsoft.POIContainerExtractionTest.testEmbeddedOfficeFiles(POIContainerExtractionTest.java:246)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.WordParserTest
-------------------------------------------------------------------------------
Tests run: 4, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.109 sec <<< 
FAILURE!
testWordHTML(org.apache.tika.parser.microsoft.WordParserTest)  Time elapsed: 
0.054 sec  <<< FAILURE!
junit.framework.AssertionFailedError: Image not found in:
        at junit.framework.Assert.fail(Assert.java:47)
        at junit.framework.Assert.assertTrue(Assert.java:20)
        at 
org.apache.tika.parser.microsoft.WordParserTest.testWordHTML(WordParserTest.java:128)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.ZipParserTest
-------------------------------------------------------------------------------
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.15 sec <<< 
FAILURE!
testZipParsing(org.apache.tika.parser.pkg.ZipParserTest)  Time elapsed: 0.083 
sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:96)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pkg.ZipParserTest.testZipParsing(ZipParserTest.java:40)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pkg.GzipParserTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.176 sec <<< 
FAILURE!
testGzipParsing(org.apache.tika.parser.pkg.GzipParserTest)  Time elapsed: 0.075 
sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:107)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.decompress(PackageExtractor.java:135)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:93)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pkg.GzipParserTest.testGzipParsing(GzipParserTest.java:40)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.AutoDetectParserTest
-------------------------------------------------------------------------------
Tests run: 17, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 4.111 sec <<< 
FAILURE!
testPDF(org.apache.tika.parser.AutoDetectParserTest)  Time elapsed: 0.04 sec  
<<< ERROR!
java.lang.ExceptionInInitializerError
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:148)
        at 
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:78)
        at 
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:111)
        at 
org.apache.tika.parser.AutoDetectParserTest.assertAutoDetect(AutoDetectParserTest.java:127)
        at 
org.apache.tika.parser.AutoDetectParserTest.testPDF(AutoDetectParserTest.java:173)
Caused by: org.apache.commons.logging.LogConfigurationException: 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category) (Caused by 
org.apache.commons.logging.LogConfigurationException: No suitable Log 
constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category))
        at 
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:543)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:235)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getInstance(LogFactoryImpl.java:209)
        at org.apache.commons.logging.LogFactory.getLog(LogFactory.java:351)
        at org.apache.pdfbox.pdfparser.BaseParser.<clinit>(BaseParser.java:58)
        ... 37 more
Caused by: org.apache.commons.logging.LogConfigurationException: No suitable 
Log constructor [Ljava.lang.Class;@622a11f3 for 
org.apache.commons.logging.impl.Log4JLogger (Caused by 
java.lang.NoClassDefFoundError: org/apache/log4j/Category)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:413)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.newInstance(LogFactoryImpl.java:529)
        ... 41 more
Caused by: java.lang.NoClassDefFoundError: org/apache/log4j/Category
        at java.lang.Class.getDeclaredConstructors0(Native Method)
        at java.lang.Class.privateGetDeclaredConstructors(Class.java:2389)
        at java.lang.Class.getConstructor0(Class.java:2699)
        at java.lang.Class.getConstructor(Class.java:1657)
        at 
org.apache.commons.logging.impl.LogFactoryImpl.getLogConstructor(LogFactoryImpl.java:410)
        ... 42 more
Caused by: java.lang.ClassNotFoundException: org.apache.log4j.Category
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at sun.misc.Launcher$ExtClassLoader.findClass(Launcher.java:229)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
        ... 47 more

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
-------------------------------------------------------------------------------
Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 1.114 sec <<< 
FAILURE!
testWordHTML(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)  Time 
elapsed: 0.125 sec  <<< FAILURE!
junit.framework.AssertionFailedError
        at junit.framework.Assert.fail(Assert.java:47)
        at junit.framework.Assert.assertTrue(Assert.java:20)
        at junit.framework.Assert.assertTrue(Assert.java:27)
        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testWordHTML(OOXMLParserTest.java:337)

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.pdf.PDFParserTest
-------------------------------------------------------------------------------
Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed: 0.073 sec <<< 
FAILURE!
testPdfParsing(org.apache.tika.parser.pdf.PDFParserTest)  Time elapsed: 0.019 
sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pdf.PDFParserTest.testPdfParsing(PDFParserTest.java:44)

testCustomMetadata(org.apache.tika.parser.pdf.PDFParserTest)  Time elapsed: 
0.035 sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pdf.PDFParserTest.testCustomMetadata(PDFParserTest.java:78)

testProtectedPDF(org.apache.tika.parser.pdf.PDFParserTest)  Time elapsed: 0.016 
sec  <<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.pdf.PDFParserTest.testProtectedPDF(PDFParserTest.java:111)

-------------------------------------------------------------------------------
Test set: org.apache.tika.TestParsers
-------------------------------------------------------------------------------
Tests run: 17, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 1.201 sec <<< 
FAILURE!
testPDFExtraction(org.apache.tika.TestParsers)  Time elapsed: 0.029 sec  <<< 
ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at org.apache.tika.Tika.parseToString(Tika.java:357)
        at org.apache.tika.Tika.parseToString(Tika.java:423)
        at org.apache.tika.Tika.parseToString(Tika.java:403)
        at 
org.apache.tika.utils.ParseUtils.getStringContent(ParseUtils.java:180)
        at org.apache.tika.TestParsers.testPDFExtraction(TestParsers.java:46)

testZipFileExtraction(org.apache.tika.TestParsers)  Time elapsed: 0.059 sec  
<<< ERROR!
java.lang.NoClassDefFoundError: Could not initialize class 
org.apache.pdfbox.pdfparser.PDFParser
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:880)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:846)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at 
org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
        at 
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:95)
        at 
org.apache.tika.parser.pkg.PackageExtractor.unpack(PackageExtractor.java:166)
        at 
org.apache.tika.parser.pkg.PackageExtractor.parse(PackageExtractor.java:96)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:61)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:216)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at org.apache.tika.Tika.parseToString(Tika.java:357)
        at org.apache.tika.Tika.parseToString(Tika.java:423)
        at org.apache.tika.Tika.parseToString(Tika.java:403)
        at 
org.apache.tika.utils.ParseUtils.getStringContent(ParseUtils.java:180)
        at 
org.apache.tika.TestParsers.testZipFileExtraction(TestParsers.java:202)

Reply via email to