[tesseract-ocr] Difference between --psm 1 and --psm 3 (default)?

Pranas Žiaukas Wed, 18 Sep 2019 05:54:34 -0700

Page segmentation modes: 
  0    Orientation and script detection (OSD) only. 
  1    Automatic page segmentation with OSD. 
  2    Automatic page segmentation, but no OSD, or OCR. (not implemented) 
  3    Fully automatic page segmentation, but no OSD. (Default) 
  4    Assume a single column of text of variable sizes. 
  5    Assume a single uniform block of vertically aligned text. 
  6    Assume a single uniform block of text. 
  7    Treat the image as a single text line. 
  8    Treat the image as a single word. 
  9    Treat the image as a single word in a circle. 
 10    Treat the image as a single character. 
 11    Sparse text. Find as much text as possible in no particular order. 
 12    Sparse text with OSD. 
 13    Raw line. Treat the image as a single text line, 
       bypassing hacks that are Tesseract-specific.



There is a batch of vastly different documents - some of them rotated (by 
90, 180, 270 degrees) - and I'm trying to get Tesseract to work on as many 
of them as possible.

What's the exact difference between "Automatic page segmentation with OSD" 
in --psm 1 and "Fully automatic page segmentation, but no OSD" in --psm 3 
besides the OSD part?
If it does exactly the same thing, just tries to detect orientation 
beforehand, I'd assume it's pretty safe to use in most cases? I'm willing 
to sacrifice performance and maybe some quality in complex cases.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/25461944-b353-44d5-8341-3f7353ca01c1%40googlegroups.com.

[tesseract-ocr] Difference between --psm 1 and --psm 3 (default)?

Reply via email to