[tesseract-ocr] Is the OSD (Orientation and Script Detection) supported in LSTM mode?

Martin Brunecky Tue, 10 May 2022 11:32:26 -0700

I can not find it anywhere in documentation, but the LSTM engine does not 
seem to support the (legacy engine) orientation and script detection. I 
recall some hints as to not supporting the script detection but nothing 
about orientation...


I found that (using oem 1 = LSTM), the API call to OSDetect(&osr) on 
rotated image  returns zero rotation. When using the 'legacy' engine (oem 
0), the same call returns the correct image rotation (in my case 90 
degrees) plus reasonable confidence.

Similarly, when iterating over the results in 'legacy' engine (oem 0) using 
mode 1 (Automatic page segmentation with OSD), the 'legacy' engine returns 
both the grapheme character and orientation. But with LSTM engine, I find 
(in debugger) that graphemes have a valid character, but their bounding box 
is empty (0,0,0,0), because internally in
    it->BoundingBox(tesseract::RIL_SYMBOL, &left, &top, &right, &bottom)
the code runs into 'bogus' coordinates and clips them all to zero.


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/aaffd6f7-b910-42e3-a036-6185dec20237n%40googlegroups.com.

[tesseract-ocr] Is the OSD (Orientation and Script Detection) supported in LSTM mode?

Reply via email to