Hi,  Just wanted to say that I spent the last few days studying your 'Using 
the latest tesseract' and related pages.  I think those are the best (and 
most timely) tesseract help docs I've seen and they were crucial in helping 
me build tesseract and helper utils and the example app in VC2008.  So now 
I have the tools to do what I wanted to do.  Yep, finally got Tortoise and 
repository check out working and VC2008 downloaded and working and 
everything buildable from code.  Thanks so much zdenop and TP!  I see that 
you two are powerhouses on the subject.

The reason I wanted to get tesseract working from console/dos/cmd (and I'll 
post this under a more relevant topic) was because I need the ACTUAL 
positions of character boxes from a recognizable text block.  NOT the 
horizontally normalized result of (in Java) getUTF8Text() / 
getCharacters().getBoxRects().  I noticed that the result of the

tesseract -l 01 01a_r1.bmp 01a_r1 batch.nochop makebox

training component DOES return the true positions.  I'm wanting to 
determine text line angles by taking advantage of a known 
multi-block/region format of my input pages over finite horizontal 
distances.  I want it as an option to other approaches to angular (and 
scale and translation) correction, knowing full well that tesseract is 
quite good at reading such non-ideal text already.  At least previous 
(<3.02) versions of tesseract can do that allright.  Just not report back 
what that off-angle actually WAS.  That info may well be included by now. 
 Dunno.  I've already explored min enclosing rectangles (and the angular 
info available from them) too.  The only other fellow I could find who 
needed the same thing I found here:

https://groups.google.com/forum/#!msg/tesseract-ocr/_J7Foxqluyg/rQhjCbMPOwQJ

My ultimate app is Java aboard Android.  I also thought the leap from dos 
exe to Android would be too much of a single leap.  Best to start with 
something reportedly working.  But you may already have shown me the 
solution, amazing Mr. zdenop, at:

http://95.154.230.254/ip-1/encoded/Oi8vcGFzdGViaW4uY29tL0MxR1pLOWJ2

But if anyone knows of another angle/translation/scale image correction 
approach (or code), I'd love to hear about it.  I.e. Image stabilization.

Thanks again zdenop and TP!

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to