Hi!

On 2015-10-27 16:10, Allistair wrote:
> Firstly I do not get Empty Page with Tesseract 3 on Mac. It reads a
> couple of lines then gives up.

Yes, that's true -- this particular example gives a few lines (actually
the *later* ones, not the first and then giving up).  But with a
slightly different example, I also get the "Empty Page" sometimes.

> I was able to get it reading everything by cropping it to the same
> amount as Working but then rotating it anti clockwise by just a few
> degrees - I tried this because I noticed the text was rotated -
> Tesseract is meant to handle this but you just need to try stuff out
> sometimes.

Ah ok, that's a good hint!  I'll try rotating my other samples and see
if it helps!

> It does mean though that you will need to do preprocessing before
> handing off to Tesseract to get whatever you're doing working.

I already do preprocessing to get the pictures as posted.  The originals
are colour photos of yellow print on a black monitor. ;)

It should be fine to also add some rotation to the preprocessing as needed.

Yours,
Daniel

-- 
http://www.domob.eu/
OpenPGP: 1142 850E 6DFF 65BA 63D6  88A8 B249 2AC4 A733 0737
Namecoin: id/domob -> https://nameid.org/?name=domob
--
Done:  Arc-Bar-Cav-Hea-Kni-Ran-Rog-Sam-Tou-Val-Wiz
To go: Mon-Pri

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/562FE35E.6020808%40gmail.com.
For more options, visit https://groups.google.com/d/optout.

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to