[
https://issues.apache.org/jira/browse/TIKA-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13765508#comment-13765508
]
Derrick Johnson commented on TIKA-1163:
---------------------------------------
I just tried out your test case, and it works. So then, what is different for
my project? Why does new Tika() work for 1.0, but not for 1.4? I'd greatly
appreciate any assistance. Here is the output of `mvn dependency:tree -verbose`
:
{code}
[INFO] | +- org.apache.tika:tika-core:jar:1.4:compile
[INFO] | +- org.apache.tika:tika-parsers:jar:1.4:compile
[INFO] | | +- (org.apache.tika:tika-core:jar:1.4:compile - omitted for
duplicate)
[INFO] | | +- org.gagravarr:vorbis-java-tika:jar:0.1:compile
[INFO] | | | +- org.gagravarr:vorbis-java-core:jar:tests:0.1:runtime
[INFO] | | | \- (org.apache.tika:tika-core:jar:1.4:compile - version managed
from 1.0; omitted for duplicate)
[INFO] | | +- edu.ucar:netcdf:jar:4.2-min:compile
[INFO] | | | \- (org.slf4j:slf4j-api:jar:1.6.4:compile - version managed
from 1.5.6; omitted for duplicate)
[INFO] | | +- (org.apache.james:apache-mime4j-core:jar:0.7:compile - version
managed from 0.7.2; omitted for duplicate)
[INFO] | | +- org.apache.james:apache-mime4j-dom:jar:0.7:compile (version
managed from 0.7.2)
[INFO] | | | \- (org.apache.james:apache-mime4j-core:jar:0.7:compile -
version managed from 0.7.2; omitted for duplicate)
[INFO] | | +- (org.apache.commons:commons-compress:jar:1.4.1:compile -
version managed from 1.5; omitted for duplicate)
[INFO] | | +- (commons-codec:commons-codec:jar:1.5:compile - omitted for
conflict with 1.4)
[INFO] | | +- org.apache.pdfbox:pdfbox:jar:1.8.1:compile
[INFO] | | | +- org.apache.pdfbox:fontbox:jar:1.8.1:compile
[INFO] | | | | \- (commons-logging:commons-logging:jar:1.1.1:compile -
omitted for duplicate)
[INFO] | | | +- org.apache.pdfbox:jempbox:jar:1.8.1:compile
[INFO] | | | \- (commons-logging:commons-logging:jar:1.1.1:compile - omitted
for duplicate)
[INFO] | | +- org.bouncycastle:bcmail-jdk15:jar:1.45:compile
[INFO] | | | \- (org.bouncycastle:bcprov-jdk15:jar:1.45:compile - omitted
for duplicate)
[INFO] | | +- org.bouncycastle:bcprov-jdk15:jar:1.45:compile
[INFO] | | +- org.apache.poi:poi:jar:3.9:compile
[INFO] | | | \- (commons-codec:commons-codec:jar:1.5:compile - omitted for
conflict with 1.4)
[INFO] | | +- org.apache.poi:poi-scratchpad:jar:3.9:compile
[INFO] | | | \- (org.apache.poi:poi:jar:3.9:compile - omitted for duplicate)
[INFO] | | +- org.apache.poi:poi-ooxml:jar:3.9:compile
[INFO] | | | +- (org.apache.poi:poi:jar:3.9:compile - omitted for duplicate)
[INFO] | | | +- org.apache.poi:poi-ooxml-schemas:jar:3.9:compile
[INFO] | | | | \- org.apache.xmlbeans:xmlbeans:jar:2.3.0:compile
[INFO] | | | \- (dom4j:dom4j:jar:1.6.1:compile - omitted for duplicate)
[INFO] | | +-
org.apache.geronimo.specs:geronimo-stax-api_1.0_spec:jar:1.0.1:compile
[INFO] | | +- org.ccil.cowan.tagsoup:tagsoup:jar:1.2.1:compile
[INFO] | | +- org.ow2.asm:asm-debug-all:jar:4.1:compile
[INFO] | | +- com.googlecode.mp4parser:isoparser:jar:1.0-RC-1:compile
[INFO] | | | \- (org.aspectj:aspectjrt:jar:1.6.11:compile - omitted for
conflict with 1.6.12)
[INFO] | | +- com.drewnoakes:metadata-extractor:jar:2.6.2:compile
[INFO] | | | +- com.adobe.xmp:xmpcore:jar:5.1.2:compile
[INFO] | | | \- xerces:xercesImpl:jar:2.8.1:compile
[INFO] | | | \- xml-apis:xml-apis:jar:1.3.03:compile
[INFO] | | +- de.l3s.boilerpipe:boilerpipe:jar:1.1.0:compile
[INFO] | | +- rome:rome:jar:0.9:compile
[INFO] | | | \- jdom:jdom:jar:1.0:compile
[INFO] | | +- org.gagravarr:vorbis-java-core:jar:0.1:compile
[INFO] | | \-
com.googlecode.juniversalchardet:juniversalchardet:jar:1.0.3:compile
{code}
> NPE thrown by TikaConfig.getDefaultConfig()
> --------------------------------------------
>
> Key: TIKA-1163
> URL: https://issues.apache.org/jira/browse/TIKA-1163
> Project: Tika
> Issue Type: Bug
> Affects Versions: 1.4
> Environment: OS-X, JDK 1.7
> Reporter: Derrick Johnson
> Attachments: TCT.java
>
>
> The below exception gets thrown every time I execute
> TikaConfig.getDefaultConfig. Similary, invoking `Tika t = new Tika()` throws
> the same exception, since code inside this constructor invokes
> TikaConfig.getDefaultConfig().
> This problem is non-existent when I use Tika-core and Tika-parsers 1.0. But
> when bump the version numbers to 1.2 (in order to get around a bug in
> PDFBOX), the problem shows up. I'm using maven. I've carefully ensured that
> I'm not pulling in the wrong version of Tika, using things like `mvn
> dependency:tree` and making there were no sneaky problematic transitive
> dependencies.
> java.lang.NullPointerException
> at
> org.apache.tika.mime.MimeTypesReader$ClauseRecord.stop(MimeTypesReader.java:245)
> at
> org.apache.tika.mime.MimeTypesReader.endElement(MimeTypesReader.java:203)
> at
> com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(AbstractSAXParser.java:606)
> at
> com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(AbstractXMLDocumentParser.java:183)
> at
> com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanStartElement(XMLDocumentFragmentScannerImpl.java:1303)
> at
> com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(XMLDocumentFragmentScannerImpl.java:2717)
> at
> com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(XMLDocumentScannerImpl.java:607)
> at
> com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:489)
> at
> com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:835)
> at
> com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:764)
> at
> com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:123)
> at
> com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1210)
> at
> com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:568)
> at
> com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl.parse(SAXParserImpl.java:302)
> at javax.xml.parsers.SAXParser.parse(SAXParser.java:195)
> at org.apache.tika.mime.MimeTypesReader.read(MimeTypesReader.java:115)
> at
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:64)
> at
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:93)
> at
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:149)
> at
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:479)
> at
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
> at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
> at
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
> at
> my.method.which.invokes `new Tika()`
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira