I'm running Tesseract in a process.  I start two processes in two
separate threads.  The processes write to a file on the FS, and then I
use the primary process to read the file to determine the result.

2.04 is substantially slower than 2.03 in my experience.  At least
using the C#/.NET wrapper that Remi put around the source for us.  I
have not tried running source directly from VS2008.

On Jun 11, 6:03 pm, Ray Smith <[email protected]> wrote:
> Tesseract is not thread safe. You can't run it in parallel in 2 threads. Not
> until at least 3.01, although 3.00 gets part of the way there.Ray.
>
>
>
> On Wed, Jun 10, 2009 at 1:49 PM, Adam <[email protected]> wrote:
>
> > I grabbed your latest version, Remi, and it's noticeably slower than
> > the last version.  I've written a simple console app in .NET C# to
> > perform the OCR operation to allow the CLR to clean up the memory
> > leak.  I then create processes in two threads to perform OCR using the
> > console app.  With version 2.03, I get between .48-.6s to process an
> > image.  With 2.04, the time went up to ~.6-.8.  With my dual-threaded
> > approach, I'm able to process as many as 3.5 images/sec in 2.03, and
> > it dropped to 2.6 images/sec in 2.04.
>
> > I'd love to see a memory leak free version, as when I don't have to
> > take on the overhead of spawning a process, I get a pretty consistent .
> > 25 process time.  With a dual threaded approach, that'd let me process
> > between 7 and 8 images a second... it'd almost double my throughput!
>
> > I'm very happy with the project, though... it's so much faster than
> > Microsoft Office Document Imaging, and I can distribute my app to
> > others and they don't need Office 2007.
>
> > On Jun 5, 10:20 am, Remi  Thomas <[email protected]> wrote:
> > > Hi,
>
> > > You can take the .NET wrapper based on version 2.04
> >http://www.pixel-technology.com/freeware/tessnet2/bin.zip
> > > Two modifications.
>
> > > SetRootPath has been removed and merge with Init
> > > Init(string tessdataPath, string lang, bool numericMode)
> > > if tessdataPath==null then Init works like previous version.
>
> > > tessnet2 assembly is now renamed tessnet2_32.dll and tessnet2_64.dll
> > > to avoid confusion between 32 and 64 bits version.
>
> > > Ray, for me everything works.
>
> > > Have fun,
> > > Remi
>
> > > On Jun 3, 7:51 pm, Ray <[email protected]> wrote:
>
> > > > The current (v250) svn code is a 2.04 release candidate.
> > > > If you are able to download from svn, and have reported an issue on
> > > > the list below, then please take a look and give it a try.
> > > > This version will be uploaded to the download page soon unless I hear
> > > > of any further problems.
> > > > NOTE that VC++ express 2005 is deprecated and no longer supported. Get
> > > > vc++ express 2008 instead.
>
> > > > After 2.04, there will be no going back:
> > > > This is that *last* version to build with VC++6! V3.00 has some new
> > > > template code that VC++6 just can't cope with.
> > > > V3.00 will have big changes to TessBaseAPI, moving towards (but not
> > > > complete) thread safety.
> > > > V3.00 will have page layout analysis that will not work well without
> > > > leptonica. As a consequence, 3.00 by default will require leptonica to
> > > > build on windows. It may be possible to disable it, but the resulting
> > > > code will have reduced functionality.
> > > > Completion of 2.04 will open the door to an upload of a preliminary
> > > > version of 3.00 to svn...
>
> > > > Here are the 2.04 release notes:
> > > > Tesseract release notes June 2 2009 - V2.04
> > > > Integrated patches for portability and to remove some of the
> > > > "access" macros.
> > > > Removed dependence on lua from the viewer making it a *lot*
> > > > faster. Also the viewer now compiles and works (on Linux.) Also works
> > > > on windows via a pre-built ScrollView.jar.
> > > > Fixed the following issues:
> > > > 1, 63, 67, 71, 76, 79, 81, 82, 84, 106, 108, 111, 112, 128, 129, 130,
> > > > 133, 135,
> > > > 142, 143, 145, 146, 147, 153, 154, 160, 165, 169, 170, 175, 177, 187,
> > > > 192,
> > > > 195, 199, 201, 205, 209.
> > > > This is the last version to support VC++6!
> > > > This may also be the last version to compile without leptonica!- Hide
> > quoted text -
>
> > > - Show quoted text -- Hide quoted text -
>
> - Show quoted text -
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to