Re: [tesseract-ocr] Tesseract 4 new Font

2017-05-17 Thread ShreeDevi Kumar
1. Which --oem are you using with tesseract 4, legacy engine or lstm? --oem 0 or --oem 1 2. Is Brazilian Portuguese very different from Portuguese? Please see the trainingtext and wordlists on https://github.com/tesseract-ocr/langdata/tree/master/por 3. Provide a sample image with it's ground

[tesseract-ocr] Tesseract 4 new Font

2017-05-17 Thread Maicon Azevedo
Hello! Guys I have tesseract 4 on Ubuntu 16.04. Running the tesseract with -l por (portuguese from Brazil) I don't have the good results. The image use other font than the trained data (I think). My question is. It's necessary to train tesseract again? I created the tif and box file with