RE: [tesseract-ocr] Bounding boxes

2018-04-30 Thread Art Rhyno .
At least one way is to use the tesseract API, see "Result iterator example" and "Example of iterator over the classifier choices for a single symbol" [1] in the wiki. You can use the same BoundingBox call, shown in the first example for tesseract::RIL_WORD, with tesseract::RIL_SYMBOL, in order

Re: [tesseract-ocr] tesseract performs wrong auto-correction sometimes : how to disable it?

2018-04-30 Thread shree
Added to issue on GitHub https://github.com/tesseract-ocr/tesseract/issues/733 On Thursday, April 26, 2018 at 1:35:30 PM UTC+5:30, Youcef wrote: > > > I'm using master branch with tessdata_fast models > > Le mercredi 25 avril 2018 18:49:22 UTC+2, shree a écrit : > >> Which version of tesseract

[tesseract-ocr] Bounding boxes

2018-04-30 Thread eya garci
How can I extract bounding boxes file and coordinates of each character using tesseract 4.0 ? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to

Re: [tesseract-ocr] Tesseract config for simple single words text and questions about learning

2018-04-30 Thread Lorenzo Bolzani
Hello ShreeDevi, thanks for your answer. I tried to use the 4.0 version but I get a different kind of errors. And, as far as I know , the whitelist is not yet supported in the 4.0 version so I decided to go with the 3.05 because I think this

Re: [tesseract-ocr] Trained font - always one letter wrong

2018-04-30 Thread ShreeDevi Kumar
Use the latest version 4.0.0beta On Sun 29 Apr, 2018, 1:51 PM , wrote: > I did. Unfortunately they don't aswer... > Have you any advice for me, to improve the > training proccess? How many training texts should i use? Or is it possible > that there is a problem with