Also, what sort of results are you getting if you recognize one character at a time instead of an entire expression?
On Wed, Dec 18, 2019, 11:45 PM Timothy Snyder <[email protected]> wrote: > Could you provide sample images from the training and testing set? I > haven't tried training Tesseract with single characters at a time but you > might want to try training on whole expressions like x+y=0. > > On Wed, Dec 18, 2019, 11:39 PM Haris Sheikh <[email protected]> wrote: > >> hi i'm using Linux (ubuntu), >> i tried tesseract training by following this >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 >> and i used data set like: >> '=' folder -> 26,000 .jpg image files in which = is written in different >> forms >> '+' folder -> 30,000 .jpg image files in which + is written in different >> forms >> so on >> >> i take all the images from each folder and paste it into ground-truth >> folder and converted those images into .tif format and also created their >> labels in .gt.txt format >> then execute the command: "make training" >> it worked fine and it took 5-6 hours to train the dataset, after that i >> used the data/foo.traineddata file and paste into >> /usr/local/share/tessdata/ directory and >> run command: "tesseract --list-langs" it showed me that there is my file >> and then >> >> *Issue is this:* >> >> when i use a sample image having "x+y=0" written, and run tesseract as my >> language it gives me output as "xxxx" *why?* >> >> *please tell me where i get wrong!* >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/9c24849b-69a5-4f6d-928f-da17420adfa3%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/9c24849b-69a5-4f6d-928f-da17420adfa3%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CABtjQ9KSROmcXYQYgF2MUMrgdE2qsJfgu-MV%3DMnu_wt_cxsU-A%40mail.gmail.com.

