1. Which --oem are you using with tesseract 4, legacy engine or lstm?
--oem 0 or --oem 1
2. Is Brazilian Portuguese very different from Portuguese? Please see the
trainingtext and wordlists on
https://github.com/tesseract-ocr/langdata/tree/master/por
3. Provide a sample image with it's ground
Hello!
Guys I have tesseract 4 on Ubuntu 16.04.
Running the tesseract with -l por (portuguese from Brazil) I don't have
the good results. The image use other font than the trained data (I think).
My question is. It's necessary to train tesseract again? I created the tif
and box file with
2 matches
Mail list logo