[Tika Wiki] Update of "PDFParser (Apache PDFBox)" by TimothyAllison

Apache Wiki Wed, 09 Nov 2016 08:01:30 -0800

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.


The "PDFParser (Apache PDFBox)" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox%29?action=diff&rev1=6&rev2=7

  
  
  == OCR ==
+ Note: the configuration of some of these features via the config file 
requires a nightly build of Tika after 11/8/2016 or Tika version >= 1.15.
+ 
  Start with the instructions on 
[[https://wiki.apache.org/tika/TikaOCR|TikaOCR]].  In short, you need to have 
Tesseract installed.
  
  There are two ways of running OCR on PDFs:

[Tika Wiki] Update of "PDFParser (Apache PDFBox)" by TimothyAllison

Reply via email to