What is the minimum jar required to use the best Tika detection can offer? Currently I am using this code to do detection (if this is the best way to do detection): Detector detector = new ContainerAwareDetector(MimeTypes.getDefaultMimeTypes()); Tika tika = new Tika(detector); String mimeType = tika.detect(TikaInputStream.get(in));
The following is mvn dependency:tree result of tika-parsers:0.9 for your convenience: [INFO] --- maven-dependency-plugin:2.1:tree (default-cli) @ tika-parsers --- [INFO] org.apache.tika:tika-parsers:bundle:0.9 [INFO] +- org.apache.tika:tika-core:jar:0.9:compile [INFO] +- edu.ucar:netcdf:jar:4.2-min:compile [INFO] | \- org.slf4j:slf4j-api:jar:1.5.6:compile [INFO] +- commons-httpclient:commons-httpclient:jar:3.1:compile [INFO] | +- commons-logging:commons-logging:jar:1.1.1:compile (version managed from 1.0.4) [INFO] | \- commons-codec:commons-codec:jar:1.2:compile [INFO] +- org.apache.james:apache-mime4j:jar:0.6:compile [INFO] +- org.apache.commons:commons-compress:jar:1.1:compile [INFO] +- org.apache.pdfbox:pdfbox:jar:1.4.0:compile [INFO] | +- org.apache.pdfbox:fontbox:jar:1.4.0:compile [INFO] | \- org.apache.pdfbox:jempbox:jar:1.4.0:compile [INFO] +- org.bouncycastle:bcmail-jdk15:jar:1.45:compile [INFO] +- org.bouncycastle:bcprov-jdk15:jar:1.45:compile [INFO] +- org.apache.poi:poi:jar:3.7:compile [INFO] +- org.apache.poi:poi-scratchpad:jar:3.7:compile [INFO] +- org.apache.poi:poi-ooxml:jar:3.7:compile [INFO] | +- org.apache.poi:poi-ooxml-schemas:jar:3.7:compile [INFO] | | \- org.apache.xmlbeans:xmlbeans:jar:2.3.0:compile [INFO] | \- dom4j:dom4j:jar:1.6.1:compile [INFO] +- org.apache.geronimo.specs:geronimo-stax-api_1.0_spec:jar:1.0.1:compile [INFO] +- org.ccil.cowan.tagsoup:tagsoup:jar:1.2:compile [INFO] +- asm:asm:jar:3.1:compile [INFO] +- com.drewnoakes:metadata-extractor:jar:2.4.0-beta-1:compile [INFO] +- de.l3s.boilerpipe:boilerpipe:jar:1.1.0:compile [INFO] +- rome:rome:jar:0.9:compile [INFO] | \- jdom:jdom:jar:1.0:compile [INFO] +- junit:junit:jar:3.8.1:test [INFO] +- org.mockito:mockito-core:jar:1.7:test [INFO] | +- org.hamcrest:hamcrest-core:jar:1.1:test [INFO] | \- org.objenesis:objenesis:jar:1.0:test [INFO] \- org.slf4j:slf4j-log4j12:jar:1.5.6:test [INFO] \- log4j:log4j:jar:1.2.14:test
