Re: OSAL required

2010-07-13 Thread Andres
Sorry for the curiosity, what OS starts with B ? Cheers, Andres 2010/7/13 Genius Mchlachla > No, its definitely B > > > On Tue, Jul 13, 2010 at 8:13 PM, Jimmy O'Regan wrote: > >> On 13 July 2010 18:27, Genius Mchlachla >> wrote: >> > Does the OS start with a 'B' by any chance? >> > >> >> I'l

Re: Localisation

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 21:48, Jeffrey Ratcliffe wrote: > On 13 July 2010 18:29, Jimmy O'Regan wrote: >> Launchpad trolling aside, I am being somewhat serious here. The usual >> output that the user will encounter in normal operation is quite >> typical, but there are quite a lot of strings that *may* be

Re: Localisation

2010-07-13 Thread Jeffrey Ratcliffe
On 13 July 2010 18:29, Jimmy O'Regan wrote: > Launchpad trolling aside, I am being somewhat serious here. The usual > output that the user will encounter in normal operation is quite > typical, but there are quite a lot of strings that *may* be output to > the user that are quite complicated and w

Re: tesseract output correction gear

2010-07-13 Thread Dmitry Silaev
there was a minor bug which prevented display of magnified textline images in the viewport after save now it's fixed eh, development version as i said On Tue, Jul 13, 2010 at 3:36 PM, Jimmy O'Regan wrote: > On 13 July 2010 11:55, daemon-s wrote: > > Please check out this little tool at > http:/

Re: OSAL required

2010-07-13 Thread Genius Mchlachla
No, its definitely B On Tue, Jul 13, 2010 at 8:13 PM, Jimmy O'Regan wrote: > On 13 July 2010 18:27, Genius Mchlachla > wrote: > > Does the OS start with a 'B' by any chance? > > > > I'll go for 'R'... RumpelstiltskinOS! > > -- > jimregan, that's because deep inside you, you are evil. > Also n

Re: OSAL required

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 18:27, Genius Mchlachla wrote: > Does the OS start with a 'B' by any chance? > I'll go for 'R'... RumpelstiltskinOS! -- jimregan, that's because deep inside you, you are evil. Also not-so-deep inside you. -- You received this message because you are subscribed to the Google

Re: OSAL required

2010-07-13 Thread Genius Mchlachla
Does the OS start with a 'B' by any chance? On Tue, Jul 13, 2010 at 7:23 PM, patrickq wrote: > Just my $0.02 opinion but I highly doubt that you are going to elicit > a lot of helpful response by approaching an open source community with > such secrecy (note the word "open" in "open source" ...).

Re: Is it possible to get a confidence value for the tesseract OCR result?

2010-07-13 Thread Patrick Questembert
Here is a code snipet: PAGE_RES* page_res_pass1 = myTess->RecognitionPass1(block_list); char *textOCR = NULL; int matchedChars = 0; int *lengths = NULL; int *x0 = NULL; int *y0 = NULL; int *x1 = NULL; int *y1 = NULL; float *costs = NULL; matchedChars = myT

Re: OSAL required

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 09:34, sai wrote: > Hi, I want to port this engine in our specific operating system( I > can't reveal the name). Can you guide me where the OS specific porting > files (OSAL) are located ? Is the code organised like this or is it > tightly coupled to particular OS like Windows,Linu

Re: OSAL required

2010-07-13 Thread patrickq
Just my $0.02 opinion but I highly doubt that you are going to elicit a lot of helpful response by approaching an open source community with such secrecy (note the word "open" in "open source" ...). On Jul 13, 4:34 am, sai wrote: > Hi, I want to port this engine in our specific operating system(

OSAL required

2010-07-13 Thread sai
Hi, I want to port this engine in our specific operating system( I can't reveal the name). Can you guide me where the OS specific porting files (OSAL) are located ? Is the code organised like this or is it tightly coupled to particular OS like Windows,Linux, etc ? -- You received this message bec

Re: *** glibc detected *** tesseract: double free or corruption

2010-07-13 Thread msjs08
Hi Thanks Cheers SJ On 13/07/10 23:33, Zdenko Podobný wrote: Hi, I have Mandriva 2010.1 64 bit (upgraded from 2010) and with tesseract-2.04-5mdv2010.1. I can run without problem (in xterm or konsole): /usr/bin/tesseract resized_slide7.tif resized_slide7 that produce output (resized_

Re: Is it possible to get a confidence value for the tesseract OCR result?

2010-07-13 Thread caro
OK, thank you for your help. Can you just precise me how to use this function TesserractExtractResult()? Thank you, Caroline On Jul 9, 6:35 pm, "Jimmy O'Regan" wrote: > On 9 July 2010 10:01, caro wrote: > > > I am working with tesseract OCR and I would like to get at the end of > > the algorit

Re: Is it possible to get a confidence value for the tesseract OCR result?

2010-07-13 Thread caro
OK, thank you for your help. Can you just precise me how to use this function TesserractExtractResult()? Thank you, Caroline On Jul 9, 6:35 pm, "Jimmy O'Regan" wrote: > On 9 July 2010 10:01, caro wrote: > > > I am working with tesseract OCR and I would like to get at the end of > > the algorit

Re: Localisation

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 16:46, Jimmy O'Regan wrote: > On 13 July 2010 16:18, Jeffrey Ratcliffe wrote: >>> Launchpad? I'll admit that I haven't looked at it in quite a while, >>> but I remember it being poor quality software producing poor quality >>> translations; I've heard that the quality of the trans

Re: Localisation

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 16:18, Jeffrey Ratcliffe wrote: >> Launchpad? I'll admit that I haven't looked at it in quite a while, >> but I remember it being poor quality software producing poor quality >> translations; I've heard that the quality of the translations has >> improved quite drastically, but I'd

Re: Localisation

2010-07-13 Thread Aiet Kolkhi
Hi, I have to admit, I too am slightly more comfortable working with Pootle (either on dedicated server or on central Locmotion one) than with Rosetta. It can be just me though. Regards, Aiet On Tue, Jul 13, 2010 at 7:18 PM, Jeffrey Ratcliffe < jeffrey.ratcli...@gmail.com> wrote: > > Launchpad

Re: Localisation

2010-07-13 Thread Zdenko Podobný
Dn(a 13.07.2010 12:29, Jimmy O'Regan wrote / napísal(a): > On 12 July 2010 20:19, Jeffrey Ratcliffe wrote: > >> On 12 July 2010 17:16, Jimmy O'Regan wrote: >> >>> Is anybody interested in seeing localisation support in Tesseract? >>> (Which begs the follow-up question: is anybody willin

Re: *** glibc detected *** tesseract: double free or corruption

2010-07-13 Thread Zdenko Podobný
Hi, I have Mandriva 2010.1 64 bit (upgraded from 2010) and with tesseract-2.04-5mdv2010.1. I can run without problem (in xterm or konsole): /usr/bin/tesseract resized_slide7.tif resized_slide7 that produce output (resized_slide7.txt) . BUT: /usr/bin/tesseract resized_slide5.tif resiz

Re: Localisation

2010-07-13 Thread Jeffrey Ratcliffe
> Launchpad? I'll admit that I haven't looked at it in quite a while, > but I remember it being poor quality software producing poor quality > translations; I've heard that the quality of the translations has > improved quite drastically, but I'd still prefer software designed by > translators for

Re: unable to parse numbers?

2010-07-13 Thread Jimmy O'Regan
On 12 July 2010 18:35, rogerdpack wrote: > Hi all.  re-posting this in its own thread: > > > Overall I'm having no success getting tesseract to decode this file > that has a few digits on it, in either Linux or Windows. > > http://myfavoritepal.com/incoming/picture10.tif > > I am on XP, 2.04, 2.00

Re: tesseract output correction gear

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 11:55, daemon-s wrote: > Please check out this little tool at > http://www.c2scan.com/pokupedia.ru/index.php?r=site/ocr > > As an output it produces an XML file containing bounding box > descriptions and character mappings. For the current demo image anyone > can make corrections

Re: Auto training

2010-07-13 Thread Jimmy O'Regan
On 12 July 2010 16:33, Rippalka wrote: > Thanks for the answer. > > Even when I don't reach the "50" and I don't add it, it doesn't give > any better results. Okl I can't remember the details of the spacing issue, but that particular example happened to be a clear example. What's less clear is th

Re: request: windows release with support for compressed tiff's

2010-07-13 Thread Jimmy O'Regan
On 12 July 2010 19:49, rogerdpack wrote: > I assume there's some type of legal difficulty to not doing this?  It I guess that you're referring to LZW-compressed tiff files, and the patent issues around LZW? The patent applies to compression, so it only affects the writing of compressed TIFFs, so

Re: suggestion: change download instructions slightly

2010-07-13 Thread Jimmy O'Regan
On 12 July 2010 16:44, rogerdpack wrote: >> > A couple of suggestions, therefore: >> > One option: change verbiage from >> >> > tesseract-2.01..tar.gz contains the language data files for >> > . You need at least one of these or tesseract will not work. >> >> > to >> >> > tesseract-2.01..tar.gz co

tesseract output correction gear

2010-07-13 Thread daemon-s
Please check out this little tool at http://www.c2scan.com/pokupedia.ru/index.php?r=site/ocr As an output it produces an XML file containing bounding box descriptions and character mappings. For the current demo image anyone can make corrections and save. These corrections will be visible to succ

Re: *** glibc detected *** tesseract: double free or corruption

2010-07-13 Thread Jimmy O'Regan
On 13 July 2010 01:59, msjs08 wrote: > Hi > I installed the only version available through the Mandriva control centre. > Version 2.04 Ah. Ok, there are quite a few memory problems in that version of Tesseract. I think, for you, the best solution would be to follow the directions on the wiki to c

Re: Localisation

2010-07-13 Thread Jimmy O'Regan
On 12 July 2010 20:19, Jeffrey Ratcliffe wrote: > On 12 July 2010 17:16, Jimmy O'Regan wrote: >> Is anybody interested in seeing localisation support in Tesseract? >> (Which begs the follow-up question: is anybody willing to contribute >> translations for their language(s)?) > > I would add the s

Re: *** glibc detected *** tesseract: double free or corruption

2010-07-13 Thread msjs08
Hi I installed the only version available through the Mandriva control centre. Version 2.04 The imagereader thing is a gui front end to tesseract. It doesn't interpret the error messages. My first problem was that I wasn't feeding 8bit images to it. I'm getting the image recognised now but it

request: windows release with support for compressed tiff's

2010-07-13 Thread rogerdpack
I assume there's some type of legal difficulty to not doing this? It would be quite convenient to have easy access to a windows build that worked with compressed tiffs--or is the current downloadable supposed to be that way? -r -- You received this message because you are subscribed to the Googl