On Fri, Aug 27, 2021 at 05:55:44PM -0700, Al Kossow via cctalk wrote: > I was also just thinking you would probably have to have a layer (black) with > all of the > stuff to OCR including the stuff in red and blue, then overlay the color on > that > after the pass through whatever you're using to do the OCR. > > The one bottleneck I would really like to fix is getting the 24 cores on my > machine doing > OCR on 24 different pages at the same time. > >
The documentation for ocrmypdf describes how to do that. https://ocrmypdf.readthedocs.io/en/latest/ and https://ocrmypdf.readthedocs.io/en/latest/batch.html Don