Hi all,

I open a new thread for Ray's answer of my last question about Tesseract
3.00. I asked Ray when the 3.00 will be release and  *how can users and
developers like us can help to make it happen sooner*. He told that he need
our input on the leptopnica usage and it will be nice it we can create wiki
for all Tesseract add-ons and create a windows installer for Tesseract and
dependency libs (like libtiff) for windows users.

>From mailing list, I saw a lot of interest from people who want to use
Tesseract as an OCR-engine for their mother tongue language. As I
understand, currently Tesseract is the best open source OCR-engine. But it
still need HELP from community to make it BETTER and EASIER to use.

We always want a BETTER and EASIER Tesseract at a SHORTER time.

So please give Ray a hand to help him on following issues and event more
interesting projects to improve Tesseract for would-be contributors at
http://code.google.com/p/tesseract-ocr/wiki/TesseractProjects.

Here is my answer for Ray questions:
*
With or without Leptopnica?*

I vote for Leptopnica because it give (1) better performance, (2)
simplification code, and especially (3) reading many more image format.

I see no reason not to use other open source libs. It helps Tesseract
concentrate on what it does BEST: a FAST and ACCURATE template matching
OCR-engine. Let use as many image reading libs, image processing libs, page
analysis libs and you can so you have time to improve OCR-engine to make it
AS GOOD AS or event BETTER current commercial OCR-engine again. Tesseract
did in in the past. Tesseract can do it again with a lot of love and help
from community.

For Windows users, we can create a Windows installer to include all
Tesseract dependency libs just like GTK installer when we need to intall
GIMP on Windows.

*Areas where the developer/user community can help:*
> A collation of all the add-ons (box editors, c#/.net extensions, Java,
apps built on top etc) and added to the wiki would be really helpful

Any author of those add-ons in the mailling list. Please help Ray to collect
useful information about your valuable add-ons. User will get a lot benefit
from your help because they open ask each other about how to train Tesseract
for a new language? How can I use Tesseract with other programming language?
It's also help to drive users to your add-on.

> A windows installer (see above) would be useful
Anyone who already wrap Tesseract as a Windows installer?

Best regards,

Tien Dung


I am hoping to push 3.00 out in the first quarter next year.
> One issue that I am concerned about (and it is why I intend to be sure 2.04
> is solid and includes as many patches as possible) is whether or not to make
> 3.00 dependent on leptonica. For certain, some of the features will only be
> available if you have it, but it is likelythat 3.00 will still build and run
> without it. At some point in the future though that may not be able to
> continue.
>
> Here are some thoughts, and I would like to get input from the
> developer/user community on this issue:
>
> For leptonica:
>
>    - Some features will depend on it. To get best performance you will
>    need it.
>    - It could allow simplification of the code, and elimination of the old
>    IMAGE class.
>    - It will allow reading of many more image formats, which a lot of
>    users have requested.
>    - It might be easier if the default windows project files assume that
>    you have leptonica. That would make it easier to build with it, and it 
> would
>    only be a case of downloading it.
>
>
> Against making tesseract dependent on leptonica:
>
>    - It will require several additional components: leptonica, libtiff,
>    libjpg, libpng, which would bloat the executable, and many (windows) users
>    have refused to even download libtiff.
>    - Installation and build support will become much more effort. (Mostly
>    for windows) If somebody could write a windows installer for it (open 
> source
>    of course), then that would simplify installation a lot for the windows
>    user-only community.
>
> Areas where the developer/user community can help:
>
>    - A collation of all the add-ons (box editors, c#/.net extensions,
>    Java, apps built on top etc) and added to the wiki would be really helpful
>    - A windows installer (see above) would be useful
>
>
> On Wed, Nov 12, 2008 at 7:50 PM, Tien Dung <[EMAIL PROTECTED]> wrote:
>
>> Hi Ray,
>>
>> There are a lot of sweet features in 3.00 release: thread-safe, patches,
>> better modular and API ...
>>
>> I would like to ask when will the 3.00 version release?
>>
>> And how can users and developers like us can help to make it happen sooner
>> :)
>>
>> Best regards,
>>
>> Tien Dung
>>
>>
>>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to