Zachary Lee Jones created TIKA-2366:
---------------------------------------
Summary: Add image cropping functionality to TesseractOCRParser
Key: TIKA-2366
URL: https://issues.apache.org/jira/browse/TIKA-2366
Project: Tika
Issue Type: Improvement
Components: ocr
Affects Versions: 1.14
Environment: ImageMagick-7.0.5, Tesseract 3.0.5
Reporter: Zachary Lee Jones
Priority: Trivial
I am using Tika's TesseractOCRParser to read scanned pdf files. It would be
nice if I could utilize ImageMagick's crop command through the
TesseractOCRParser so that document headers/footers can be ignored.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)