Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "PDFParser (Apache PDFBox)" page has been changed by TimothyAllison: https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox%29?action=diff&rev1=6&rev2=7 == OCR == + Note: the configuration of some of these features via the config file requires a nightly build of Tika after 11/8/2016 or Tika version >= 1.15. + Start with the instructions on [[https://wiki.apache.org/tika/TikaOCR|TikaOCR]]. In short, you need to have Tesseract installed. There are two ways of running OCR on PDFs:
