Also http://tesseract-ocr.googlecode.com/svn/trunk/ChangeLog
2014-02-04 v3.03 * Added new training tool text2image to generate box/tif file pairs from text and truetype fonts. * Added support for PDF output with searchable text. * Removed entire IMAGE class and all code in image directory. * Tesseract executable: support for output to stdout; limited support for one page images from stdin (especially on Windows) * Added Renderer to API to allow document-level processing and output of document formats, like hOCR, PDF. * Major refactor of word-level recognition, beam search, eliminating dead code. * Refactored classifier to make it easier to add new ones. * Generalized feature extractor to allow feature extraction from greyscale. * Improved sub/superscript treatment. * Improved baseline fit. * Added set_unicharset_properties to training tools. * Many bug fixes. * More training source data included. On Saturday, August 30, 2014 3:34:06 PM UTC+5:30, Paul Vorbach wrote: > > I think a lot of this is caused by the project home page claiming that > Tesseract 3.03 is shipped with Ubuntu 2014.04. This sounds like it is a > final release. I'd change or remove that statement. > > Roadmap >> >> Version 3.03 release candidate is now available (source only so far) for >> download and contains many new features. (See the ReleaseNotes >> <https://code.google.com/p/tesseract-ocr/wiki/ReleaseNotes> for a full >> list.) Please check out the ReadMe >> <http://code.google.com/p/tesseract-ocr/wiki/ReadMe> before going to >> Downloads <http://code.google.com/p/tesseract-ocr/downloads/list> as you >> need more than one file. *Even the windows executables tarball is >> incomplete as language files are required.* Most notable new features: >> >> - PDF output. >> - New Renderer for extracting detailed recognition information at a >> document level. >> >> *Version 3.03 ships with recent Linux distributions such as Ubuntu 14.04.* >> >> Version 3.02 ships with Ubuntu 12.04 >> > Paul > > > Am Mittwoch, 27. August 2014 11:11:32 UTC+2 schrieb zdenop: >> >> Anybody who is packaging tesseract and publicaly sharing 3.03 (excluding >> -rc1) and 3.04 is lying. There are no such releases. >> Repository is intended for developers and testers not for packagers! And >> it is absolutely normal that there are changes of version withing >> repository. There are for developers and testers. >> >> If packagers are not able to respect project (there are reasong why there >> is no new release) that we should we should remove public tesseract >> repository. >> >> Zdenko >> >> >> On Wed, Aug 27, 2014 at 3:46 AM, shree <shree...@gmail.com> wrote: >> >>> Zdenko, >>> >>> Sorry it was not meant to be a 'release' of 3.04, I just wanted to get >>> the latest code compiled under msys2 and asked the developers for help and >>> suggested a package of tesseract and leptonica under msys2. I presume, it >>> is ok to label it as 3.03 with the *Revision:* *298e31465a44.* >>> >>> However, as I had asked you in an earlier post, your last commit of >>> configure.ac does show tesseract version as 3.04 >>> >>> >> >>> # ---------------------------------------- >>> # Initialization >>> # ---------------------------------------- >>> >>> AC_PREREQ(2.50) >>> AC_INIT([tesseract], [3.04], [ >>> http://code.google.com/p/tesseract-ocr/issues/list]) >> >>> >>> FYI, training tools did compile under msys2 on windows8. >>> >>> Thanks, >>> Shree >>> >>> >>> >>> On Tuesday, August 26, 2014 4:39:47 PM UTC+5:30, zdenop wrote: >>> >>>> Please stop with this releases!!! >>>> 3.04 was not released! We are skipping 3.03 release because some people >>>> decided to spread 3.03 on internet and there was need to change API. AFAIK >>>> more API changes for 3.04 should come! >>>> You are not helping this project defintely. >>>> >>>> Zdenko >>>> >>>> >>>> On Tue, Aug 26, 2014 at 10:03 AM, Shree Devi Kumar <shree...@gmail.com> >>>> wrote: >>>> >>>>> Please note that this does NOT install any language data. >>>>> >>>>> Shree Devi Kumar >>>>> ____________________________________________________________ >>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>> >>>>> >>>>> On Tue, Aug 26, 2014 at 1:05 PM, Shree Devi Kumar <shree...@gmail.com> >>>>> wrote: >>>>> >>>>>> Follow instructions on >>>>>> >>>>>> https://sourceforge.net/p/msys2/wiki/MSYS2%20installation/ >>>>>> >>>>>> to setup msys2 >>>>>> >>>>>> >>>>>> - 43 minutes ago [image: Alexx83]Alexx83 <http://ehc.ac/u/lexx83/> >>>>>> posted a comment >>>>>> <http://ehc.ac/p/msys2/tickets/71/?limit=25#77dc> on ticket #71 >>>>>> <http://ehc.ac/p/msys2/tickets/71/> >>>>>> >>>>>> Now tesseract-orc can be installed via pacman. For future, I >>>>>> prefer to discuss issues... >>>>>> - 46 minutes agoMSYS2 <http://ehc.ac/p/msys2/> released >>>>>> /REPOS/MINGW/x86_64/mingw-w64-x86_64-tesseract-ocr-3.04-1-any.pkg. >>>>>> tar.xz.sig >>>>>> >>>>>> <http://sourceforge.net/projects/msys2/files/REPOS/MINGW/x86_64/mingw-w64-x86_64-tesseract-ocr-3.04-1-any.pkg.tar.xz.sig/download> >>>>>> >>>>>> - 46 minutes ago MSYS2 <http://ehc.ac/p/msys2/> released >>>>>> /REPOS/MINGW/x86_64/mingw-w64-x86_64-tesseract-ocr-3.04-1-any.pkg. >>>>>> tar.xz >>>>>> >>>>>> <http://sourceforge.net/projects/msys2/files/REPOS/MINGW/x86_64/mingw-w64-x86_64-tesseract-ocr-3.04-1-any.pkg.tar.xz/download> >>>>>> >>>>>> - 48 minutes ago MSYS2 <http://ehc.ac/p/msys2/> released >>>>>> /REPOS/MINGW/i686/mingw-w64-i686-tesseract- >>>>>> ocr-3.04-1-any.pkg.tar.xz.sig >>>>>> >>>>>> <http://sourceforge.net/projects/msys2/files/REPOS/MINGW/i686/mingw-w64-i686-tesseract-ocr-3.04-1-any.pkg.tar.xz.sig/download> >>>>>> >>>>>> - 48 minutes ago >>>>>> >>>>>> >>>>>> ---------- Forwarded message ---------- >>>>>> From: Alexx83 <lex...@users.sf.net> >>>>>> Date: Tue, Aug 26, 2014 at 12:21 PM >>>>>> Subject: [msys2:tickets] #71 tesseract-ocr build failed with bad >>>>>> reloc address 0x23 >>>>>> To: "[msys2:tickets]" <7...@tickets.msys2.p.re.sf.net> >>>>>> >>>>>> Now tesseract-orc can be installed via pacman. >>>>>> For future, I prefer to discuss issues with present packages or new >>>>>> packages adding on github: >>>>>> https://github.com/Alexpux/MINGW-packages >>>>>> >>>>>> For MSYS2 packages: >>>>>> https://github.com/Alexpux/MSYS2-packages >>>>>> >>>>>> You can clone git repo with our scripts and create pull requests with >>>>>> fixes or new packages. >>>>>> >>>>>> >>>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-dev" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to tesseract-de...@googlegroups.com. >>>>> To post to this group, send email to tesser...@googlegroups.com. >>>>> >>>>> Visit this group at http://groups.google.com/group/tesseract-dev. >>>>> To view this discussion on the web visit https://groups.google.com/d/ >>>>> msgid/tesseract-dev/CAG2NduVZ4sEhonj8YXAZk5xh0S9pm >>>>> 8HrWA4RfLXSbJSbeSL%3DGA%40mail.gmail.com >>>>> <https://groups.google.com/d/msgid/tesseract-dev/CAG2NduVZ4sEhonj8YXAZk5xh0S9pm8HrWA4RfLXSbJSbeSL%3DGA%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6a818ea1-fdbc-42e9-8ff8-1a2cb68fc955%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.