On Wednesday, July 31, 2013 3:20:43 AM UTC-4, [email protected] wrote:

> I am doing some research on tesseract-ocr, the most famous OCR software in 
> the world. However, I am stuck in some code(for example, the feature 
> extraction algorithms in the function ExtractIntFeat). I took long time 
> to search google for explanation about the algorithms but got no result. 
> The only possible explanation, I think, may be in Ray Smith's Ph.D 
> dissertation,"The Extraction and Recognition of Text from Multimedia 
> Document Images", but I cannot find it on the Internet. Can somebody kindly 
> send me a copy of the Ph.D dissertation, or give me some sugguestions so 
> that I can read through the code of tesseract?
>

You can get it here: 
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.380162

There are also a number of more recent papers on Ray's page at Google 
Research.

Tom

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to