Re: [tesseract-ocr] Tesseract 4 + OpenCL?

2020-03-10 Thread JB Data31
A recent successful build with V5 . I think the documentation is more for V5 and less for V3. @*JB*Δ Le lun. 9 mars 2020 à 23:37, Matt Chapman

[tesseract-ocr] Re: Need of bounding boxes coordinates of individual letters from image in hocr format

2020-03-10 Thread Juanjo Serrano Lloria
Hi, Did you try activating the makebox configuration? Example in command mode: tesseract isis_0153.png isis_0153 makebox hocr El martes, 10 de marzo de 2020, 10:47:09 (UTC+1), Preetilatha Ramalingam escribió: > > Hi, > >I'm able to get the bounding box coordinates of words in hocr format >

[tesseract-ocr] Need of bounding boxes coordinates of individual letters from image in hocr format

2020-03-10 Thread Preetilatha Ramalingam
Hi, I'm able to get the bounding box coordinates of words in hocr format using the function pytesseract.image_to_pdf_or_hocr(imge,lang='eng',extension='hocr') and I get the below output. http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd;> http://www.w3.org/1999/xhtml; xml:lang="en"

[tesseract-ocr] Tesseract box file not recognizing some character/word.

2020-03-10 Thread Peter
Hi, When I create box files from Tesseract and view it from the editor, there are cases (attached picture) when some of the words don’t have bounding boxes. Is this because Tesseract cannot interpret the character/word? [image: sample.PNG] Currently using tesseract v5.0.0-alpha Thanks,