Hi all, I open a new thread for Ray's answer of my last question about Tesseract 3.00. I asked Ray when the 3.00 will be release and *how can users and developers like us can help to make it happen sooner*. He told that he need our input on the leptopnica usage and it will be nice it we can create wiki for all Tesseract add-ons and create a windows installer for Tesseract and dependency libs (like libtiff) for windows users.
>From mailing list, I saw a lot of interest from people who want to use Tesseract as an OCR-engine for their mother tongue language. As I understand, currently Tesseract is the best open source OCR-engine. But it still need HELP from community to make it BETTER and EASIER to use. We always want a BETTER and EASIER Tesseract at a SHORTER time. So please give Ray a hand to help him on following issues and event more interesting projects to improve Tesseract for would-be contributors at http://code.google.com/p/tesseract-ocr/wiki/TesseractProjects. Here is my answer for Ray questions: * With or without Leptopnica?* I vote for Leptopnica because it give (1) better performance, (2) simplification code, and especially (3) reading many more image format. I see no reason not to use other open source libs. It helps Tesseract concentrate on what it does BEST: a FAST and ACCURATE template matching OCR-engine. Let use as many image reading libs, image processing libs, page analysis libs and you can so you have time to improve OCR-engine to make it AS GOOD AS or event BETTER current commercial OCR-engine again. Tesseract did in in the past. Tesseract can do it again with a lot of love and help from community. For Windows users, we can create a Windows installer to include all Tesseract dependency libs just like GTK installer when we need to intall GIMP on Windows. *Areas where the developer/user community can help:* > A collation of all the add-ons (box editors, c#/.net extensions, Java, apps built on top etc) and added to the wiki would be really helpful Any author of those add-ons in the mailling list. Please help Ray to collect useful information about your valuable add-ons. User will get a lot benefit from your help because they open ask each other about how to train Tesseract for a new language? How can I use Tesseract with other programming language? It's also help to drive users to your add-on. > A windows installer (see above) would be useful Anyone who already wrap Tesseract as a Windows installer? Best regards, Tien Dung I am hoping to push 3.00 out in the first quarter next year. > One issue that I am concerned about (and it is why I intend to be sure 2.04 > is solid and includes as many patches as possible) is whether or not to make > 3.00 dependent on leptonica. For certain, some of the features will only be > available if you have it, but it is likelythat 3.00 will still build and run > without it. At some point in the future though that may not be able to > continue. > > Here are some thoughts, and I would like to get input from the > developer/user community on this issue: > > For leptonica: > > - Some features will depend on it. To get best performance you will > need it. > - It could allow simplification of the code, and elimination of the old > IMAGE class. > - It will allow reading of many more image formats, which a lot of > users have requested. > - It might be easier if the default windows project files assume that > you have leptonica. That would make it easier to build with it, and it > would > only be a case of downloading it. > > > Against making tesseract dependent on leptonica: > > - It will require several additional components: leptonica, libtiff, > libjpg, libpng, which would bloat the executable, and many (windows) users > have refused to even download libtiff. > - Installation and build support will become much more effort. (Mostly > for windows) If somebody could write a windows installer for it (open > source > of course), then that would simplify installation a lot for the windows > user-only community. > > Areas where the developer/user community can help: > > - A collation of all the add-ons (box editors, c#/.net extensions, > Java, apps built on top etc) and added to the wiki would be really helpful > - A windows installer (see above) would be useful > > > On Wed, Nov 12, 2008 at 7:50 PM, Tien Dung <[EMAIL PROTECTED]> wrote: > >> Hi Ray, >> >> There are a lot of sweet features in 3.00 release: thread-safe, patches, >> better modular and API ... >> >> I would like to ask when will the 3.00 version release? >> >> And how can users and developers like us can help to make it happen sooner >> :) >> >> Best regards, >> >> Tien Dung >> >> >> --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

