My suggestion would be to do post processing of the OCR output.
On Mon 2 Apr, 2018, 6:09 PM JP T, wrote:
> Hi
>
> I don't really got an understanding of the consequences of training.
>
> My problem:
> I've got tons of pages with a special format. ("one place study"
Hi
I don't really got an understanding of the consequences of training.
My problem:
I've got tons of pages with a special format. ("one place study" about the
historic inhabitants of a town)
tesseract repeatedly fails on a few special words:
oo (oh-oh) at start of line for "wedding" is often
2 matches
Mail list logo