Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.

The "TikaOCR" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/TikaOCR?action=diff&rev1=9&rev2=10

  
  = OCR and PDFs =
  
+ With Tika server, the PDFConfig is generated for each document, so any 
configurations that you may do in the tika-config.xml file are overwritten.
+ You need to specify configurations for the PDFParser like so:
+ 
+ `curl -T testOCR.pdf http://localhost:9998/rmeta/text --header 
"X-Tika-PDFextractInlineImages: true"`
+ 
  See 
[[https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox%29|PDFParser 
notes]].
  
  = Disable Tika OCR =

Reply via email to