Hello Neil,

Thank you very much for your reply.

Right now I am using the box property of tesseract to know the
character position which finally yields to the line detection.

However I observed the following scenario:

line1: "He who fights and runs away
line2:                                               may live to fight
another day"


Now box is created in such a fashion that "away" and "may"  are
clubbed together and only one line created like

"He who fights and runs awaymay live to fight another day"

So, Is there any variable which will help me separate these two lines?

Thanks & regards,
Vikas


On Apr 21, 7:58 pm, Neil Benn <[email protected]> wrote:
> Hello,
>
>          I'm copying in the group to keep this on the group chat - can you
> do the same please.
>
>    AFAIK, there is no other library based on tesseract which provides the
> information you are looking for on windows - sorry.
>
> Cheers,
>
> Neil
>
> On 21 April 2010 16:53, vikas landge <[email protected]> wrote:
>
>
>
> > Hello Neil,
>
> > Thank you very much for your immediate reply.
>
> > I am essentially working on .NET platform and I found that OCRopus
> > does not have built for windows. Is it correct?
>
> > Is there any other library based on tesseract which can give me this
> > kind of information?
>
> > regards,
> > Vikas
>
> > On Apr 20, 9:40 pm, "Neil Benn" <[email protected]> wrote:
> > > Hello,
>
> > >        Tesseract is not great at that though it will try to split two
> > lines
> > > up a bit; Tesseract is more 'pure' character recognition.  If I were you
> > I'd
> > > look at OCRopus which can do what you are looking for.
>
> > > Cheers,
>
> > > Neil
>
> > > -----Original Message-----
> > > From: [email protected] [mailto:
> > [email protected]]
>
> > > On Behalf Of vikas landge
> > > Sent: 20 April 2010 15:50
> > > To: tesseract-ocr
> > > Subject: extracting line information
>
> > > I am new to tesseract-ocr.
>
> > > I am interested in getting the line information from the image. e.g.
> > > suppose I have following data in my image
>
> > > Name: John Smith
> > > Age: 25
>
> > > I would like to obtain the information as two separate strings from
> > > tesseract. e.g.
>
> > > string1:"Name: John Smith"
> > > string2: "Age: 25"
>
> > > Is it possible using tesseract?
>
> > > It would be great if someone could help me. Thanks in advance.
>
> > > --
> > > You received this message because you are subscribed to the Google Groups
> > > "tesseract-ocr" group.
> > > To post to this group, send email to [email protected].
> > > To unsubscribe from this group, send email to
> > > [email protected]<tesseract-ocr%[email protected]>
> > .
> > > For more options, visit this group athttp://
> > groups.google.com/group/tesseract-ocr?hl=en.
>
> > > --
> > > You received this message because you are subscribed to the Google Groups
> > "tesseract-ocr" group.
> > > To post to this group, send email to [email protected].
> > > To unsubscribe from this group, send email to
> > [email protected]<tesseract-ocr%[email protected]>
> > .
> > > For more options, visit this group athttp://
> > groups.google.com/group/tesseract-ocr?hl=en.
>
> --
> --
>
> Neil Benn Msc
> Director
> Ziath Ltd
> Phone :+44 (0)7508 107942
> Website -http://www.ziath.com
>
> IMPORTANT NOTICE:  This message, including any attached documents, is
> intended only for the use of the individual or entity to which it is
> addressed, and may contain information that is privileged, confidential and
> exempt from disclosure under applicable law.  If the reader of this message
> is not the intended recipient, or the employee or agent responsible for
> delivering the message to the intended recipient, you are hereby notified
> that any dissemination, distribution or copying of this communication is
> strictly prohibited. If you have received this communication in error,
> please notify Ziath Ltd immediately by email at [email protected]. Thank you.
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group 
> athttp://groups.google.com/group/tesseract-ocr?hl=en.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to