Re: OCR with bounding boxes

Tim Allison Fri, 29 Oct 2021 10:57:51 -0700

That's not currently supported, and in fact, I don't think we even support
running OCR on specific pages within PDFs (and I do remember we've had that
request occasionally).  Would this be a per-file configuration or would you
want to specify something for all files?


On Fri, Oct 29, 2021 at 12:55 PM Peter Kronenberg <[email protected]>
wrote:

> I’m pretty sure this is a capability of Tesseract, but does Tika have the
> ability to specify a bounding box when OCR’ing a page?  So if we want to
> give it the coordinates of a single paragraph or section of a document?
>
>
>
>
>
> Thanks
>
> Peter
>
>
>
> *Peter Kronenberg*  *| * *Senior AI Analytic ENGINEER *
>
> *C: 703.887.5623*
>
> [image: Torch AI] <http://www.torch.ai/>
>
> 4303 W. 119th St., Leawood, KS 66209
> WWW.TORCH.AI <http://www.torch.ai/>
>
>
>
>
>

Re: OCR with bounding boxes

Reply via email to