That's not currently supported, and in fact, I don't think we even support running OCR on specific pages within PDFs (and I do remember we've had that request occasionally). Would this be a per-file configuration or would you want to specify something for all files?
On Fri, Oct 29, 2021 at 12:55 PM Peter Kronenberg <[email protected]> wrote: > I’m pretty sure this is a capability of Tesseract, but does Tika have the > ability to specify a bounding box when OCR’ing a page? So if we want to > give it the coordinates of a single paragraph or section of a document? > > > > > > Thanks > > Peter > > > > *Peter Kronenberg* *| * *Senior AI Analytic ENGINEER * > > *C: 703.887.5623* > > [image: Torch AI] <http://www.torch.ai/> > > 4303 W. 119th St., Leawood, KS 66209 > WWW.TORCH.AI <http://www.torch.ai/> > > > > >
