Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "Troubleshooting Tika" page has been changed by NickBurch: https://wiki.apache.org/tika/Troubleshooting%20Tika?action=diff&rev1=3&rev2=4 = Troubleshooting Apache Tika = - <<TableOfContents(3)>> Apache Tika is great when it works, but by default can be silently forgiving of configuration mistakes. Here we'll try to cover some of the main problems, and how to go about diagnosing them Note that while the underlying cause is often the same no matter how you call Tika, the way of telling what's wrong can vary between them. + + <<TableOfContents(3)>> == Wrong Content Extracted == * Make sure you're passing Tika the source file you meant to pass, and it hasn't been corrupted in the transfer process @@ -18, +19 @@ * Make sure you're actually using the version of Tika you meant to use! See '''Identifying your Tika Version''' == Wrong Parser Used == + * Make sure Tika is able to correctly detect your file's type, see '''Content Incorrectly Detected''' + * Make sure the parser you wanted to use is available to Tika. See '''Identifying what Parsers your Tika install supports''', '''Parsers Missing''' and '''Identifying is any Parsers failed to be loaded''' == Content Incorrectly Detected == + Tika detects content types based on mime magic, format (normally container) specific detectors, content type hints and filename hints. + + Things to check: + * Does Tika know about your type? See '''Identifying what Mime Types your Tika install supports''' + * If Tika doesn't out of the box, you need to add a custom mimetypes file. See [[http://tika.apache.org/1.9/parser_guide.html#Add_your_MIME-Type|the quick guide]] for how + * If you have written a custom mimetypes file, it needs to be present on your classpath at runtime with the ''exact'' name of `org/apache/tika/mime/custom-mimetypes.xml` . Double check you added it to your classpath, it has exactly that name (no typos, no prefix directories, no suffixes etc), and use '''Identifying what Mime Types your Tika install supports''' to see if you've loaded it or not == Parsers Missing ==
