On 5 July 2010 07:23, Lars Aronsson <[email protected]> wrote:
> On 07/04/2010 11:47 PM, fontenot.1031 wrote:
>>>
>>> You'll need to upscale the image.  Before reducing it to
>>
>> Thanks for responding. I really appreciate it. Can you tell me what
>> upscaling is or how to do it with ImageMagick? I don't know that much
>> about images, jpeg or tiff. Thanks a lot. (also I think the imgur link
>> is messed up because the version on my computer is a lot bigger /
>> clearer).
>
> I think that Tesseract, in order to be a successful project, must
> be much more clear about what it is offering.
>

>From the README:

"About the Engine

This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI."

What's unclear about that?

> Now many people believe it is "an OCR program" that can function as
> an alternative to commercial end user products.

Those people clearly haven't bothered to read the README.

> Some open source
> software in other fields (especially OpenOffice and Firefox) can
> meet such expectations. So it's natural that complete beginners
> come to this list with basic questions about what a bitmap image

No, it's not, really. Nobody comes to the Firefox mailing list asking
what a webpage is.

> is. The commercial end user products would not bother their
> customers with such details.
>
> But today's Tesseract is much more like a subroutine library
> that requires or at least assumes that its users are programmers.

There are a number of GUIs out there for Tesseract, both open source
and commercial. OCRFeeder is the last one I saw a demo of; it's quite
nice. If you want to point and click at things and no think about what
you're doing, maybe you should use that.

> The experts on this list are not really interested in explaining
> what a bitmap image is. This mismatch comes from the failure to
> explain what Tesseract is.

It comes from the failure to read the explanation of what it is.
People are lazy, sure, I understand that. But I for one don't intend
to spend a whole lot of time accommodating that.

In future, please do not hijack threads. Your interjection has nothing
to do with the question at hand -- that image would pose a similar
problem for commercial OCR systems, too. I'll bet you a beer that
FineReader will pick nothing out of that image either, and FineReader
does not make any attempt to rescale images.

-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to