Is there a way to disable Tesseract's automatic image pre-processing? I am running an ImageMagick script to process the picture beforehand (it works a lot better than tesseract's own preprocessing), so I want to make sure that Tesseract doesn't waste any time doing this. Apart from taking a lot of time, tesseract's preprocessing somewhy rotates the image sideways, even when it is correctly oriented to begin with. Does anybody know why it does this?
Oh, and another random thing: can somebody explain what the '-psm 2' option does? It says that it segments without doing OSD or OCR... so what output is there exactly? I am trying to maximise speed so it would be great if tesseract could segment the page and then distribute the parts to separate servers to perform OCR in parallel, but I can't figure out what '-psm 2' actually does. -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.

