Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.

The "Troubleshooting Tika" page has been changed by NickBurch:
https://wiki.apache.org/tika/Troubleshooting%20Tika?action=diff&rev1=3&rev2=4

  = Troubleshooting Apache Tika =
- <<TableOfContents(3)>>
  Apache Tika is great when it works, but by default can be silently forgiving 
of configuration mistakes. Here we'll try to cover some of the main problems, 
and how to go about diagnosing them
  
  Note that while the underlying cause is often the same no matter how you call 
Tika, the way of telling what's wrong can vary between them.
+ 
+ <<TableOfContents(3)>>
  
  == Wrong Content Extracted ==
   * Make sure you're passing Tika the source file you meant to pass, and it 
hasn't been corrupted in the transfer process
@@ -18, +19 @@

   * Make sure you're actually using the version of Tika you meant to use! See 
'''Identifying your Tika Version'''
  
  == Wrong Parser Used ==
+  * Make sure Tika is able to correctly detect your file's type, see 
'''Content Incorrectly Detected'''
+  * Make sure the parser you wanted to use is available to Tika. See 
'''Identifying what Parsers your Tika install supports''', '''Parsers 
Missing''' and '''Identifying is any Parsers failed to be loaded'''
  
  == Content Incorrectly Detected ==
+ Tika detects content types based on mime magic, format (normally container) 
specific detectors, content type hints and filename hints.
+ 
+ Things to check:
+  * Does Tika know about your type? See '''Identifying what Mime Types your 
Tika install supports'''
+  * If Tika doesn't out of the box, you need to add a custom mimetypes file. 
See [[http://tika.apache.org/1.9/parser_guide.html#Add_your_MIME-Type|the quick 
guide]] for how
+  * If you have written a custom mimetypes file, it needs to be present on 
your classpath at runtime with the ''exact'' name of 
`org/apache/tika/mime/custom-mimetypes.xml` . Double check you added it to your 
classpath, it has exactly that name (no typos, no prefix directories, no 
suffixes etc), and use '''Identifying what Mime Types your Tika install 
supports''' to see if you've loaded it or not
  
  == Parsers Missing ==
  

Reply via email to