Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "TikaOCR" page has been changed by TimothyAllison: https://wiki.apache.org/tika/TikaOCR?action=diff&rev1=9&rev2=10 = OCR and PDFs = + With Tika server, the PDFConfig is generated for each document, so any configurations that you may do in the tika-config.xml file are overwritten. + You need to specify configurations for the PDFParser like so: + + `curl -T testOCR.pdf http://localhost:9998/rmeta/text --header "X-Tika-PDFextractInlineImages: true"` + See [[https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox%29|PDFParser notes]]. = Disable Tika OCR =
