Hi,

I was making detection for an image of a Japanese document, the command 
was: *tesseract image results_text -l jpn  --tessdata-dir ./tessdata -c 
preserve_interword_spaces=1 --oem 1 *, and I noticed when I add the 
argument --psm 12 the accuracy is quite  better, as far as I know that the 
PSM is concerned with page segmentation meaning if there is a table in the 
image or to correct the orientation of the text in the image, and "12" is 
using the OSD (Orientation and Script Detection)  but I don't know why the 
detection was better.

so my question is , in general, is the detection always better with the 
argument psm 12, or its just a coincidence? keep in mind the image didn't 
contain any table, but skewed a little bit

Thanks

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ce275be1-7b24-4ea0-ac89-1e3bb722434d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to