Hello,
Sorry dunno - you are looking more in depth to Tesseract than I
need to. Hopefully someone else can help.
Cheers,
Neil
On 23 April 2010 10:14, vikas landge <[email protected]> wrote:
> Hello Neil,
>
> Thank you very much for your reply.
>
> Right now I am using the box property of tesseract to know the
> character position which finally yields to the line detection.
>
> However I observed the following scenario:
>
> line1: "He who fights and runs away
> line2: may live to fight
> another day"
>
>
> Now box is created in such a fashion that "away" and "may" are
> clubbed together and only one line created like
>
> "He who fights and runs awaymay live to fight another day"
>
> So, Is there any variable which will help me separate these two lines?
>
> Thanks & regards,
> Vikas
>
>
> On Apr 21, 7:58 pm, Neil Benn <[email protected]> wrote:
> > Hello,
> >
> > I'm copying in the group to keep this on the group chat - can
> you
> > do the same please.
> >
> > AFAIK, there is no other library based on tesseract which provides the
> > information you are looking for on windows - sorry.
> >
> > Cheers,
> >
> > Neil
> >
> > On 21 April 2010 16:53, vikas landge <[email protected]> wrote:
> >
> >
> >
> > > Hello Neil,
> >
> > > Thank you very much for your immediate reply.
> >
> > > I am essentially working on .NET platform and I found that OCRopus
> > > does not have built for windows. Is it correct?
> >
> > > Is there any other library based on tesseract which can give me this
> > > kind of information?
> >
> > > regards,
> > > Vikas
> >
> > > On Apr 20, 9:40 pm, "Neil Benn" <[email protected]> wrote:
> > > > Hello,
> >
> > > > Tesseract is not great at that though it will try to split two
> > > lines
> > > > up a bit; Tesseract is more 'pure' character recognition. If I were
> you
> > > I'd
> > > > look at OCRopus which can do what you are looking for.
> >
> > > > Cheers,
> >
> > > > Neil
> >
> > > > -----Original Message-----
> > > > From: [email protected] [mailto:
> > > [email protected]]
> >
> > > > On Behalf Of vikas landge
> > > > Sent: 20 April 2010 15:50
> > > > To: tesseract-ocr
> > > > Subject: extracting line information
> >
> > > > I am new to tesseract-ocr.
> >
> > > > I am interested in getting the line information from the image. e.g.
> > > > suppose I have following data in my image
> >
> > > > Name: John Smith
> > > > Age: 25
> >
> > > > I would like to obtain the information as two separate strings from
> > > > tesseract. e.g.
> >
> > > > string1:"Name: John Smith"
> > > > string2: "Age: 25"
> >
> > > > Is it possible using tesseract?
> >
> > > > It would be great if someone could help me. Thanks in advance.
> >
> > > > --
> > > > You received this message because you are subscribed to the Google
> Groups
> > > > "tesseract-ocr" group.
> > > > To post to this group, send email to [email protected].
> > > > To unsubscribe from this group, send email to
> > > > [email protected]<tesseract-ocr%[email protected]>
> <tesseract-ocr%[email protected]<tesseract-ocr%[email protected]>
> >
> > > .
> > > > For more options, visit this group athttp://
> > > groups.google.com/group/tesseract-ocr?hl=en.
> >
> > > > --
> > > > You received this message because you are subscribed to the Google
> Groups
> > > "tesseract-ocr" group.
> > > > To post to this group, send email to [email protected].
> > > > To unsubscribe from this group, send email to
> > > [email protected]<tesseract-ocr%[email protected]>
> <tesseract-ocr%[email protected]<tesseract-ocr%[email protected]>
> >
> > > .
> > > > For more options, visit this group athttp://
> > > groups.google.com/group/tesseract-ocr?hl=en.
> >
> > --
> > --
> >
> > Neil Benn Msc
> > Director
> > Ziath Ltd
> > Phone :+44 (0)7508 107942
> > Website -http://www.ziath.com
> >
> > IMPORTANT NOTICE: This message, including any attached documents, is
> > intended only for the use of the individual or entity to which it is
> > addressed, and may contain information that is privileged, confidential
> and
> > exempt from disclosure under applicable law. If the reader of this
> message
> > is not the intended recipient, or the employee or agent responsible for
> > delivering the message to the intended recipient, you are hereby notified
> > that any dissemination, distribution or copying of this communication is
> > strictly prohibited. If you have received this communication in error,
> > please notify Ziath Ltd immediately by email at [email protected]. Thank
> you.
> >
> > --
> > You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> > To post to this group, send email to [email protected].
> > To unsubscribe from this group, send email to
> [email protected]<tesseract-ocr%[email protected]>
> .
> > For more options, visit this group athttp://
> groups.google.com/group/tesseract-ocr?hl=en.
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected]<tesseract-ocr%[email protected]>
> .
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
--
Neil Benn Msc
Director
Ziath Ltd
Phone :+44 (0)7508 107942
Website - http://www.ziath.com
IMPORTANT NOTICE: This message, including any attached documents, is
intended only for the use of the individual or entity to which it is
addressed, and may contain information that is privileged, confidential and
exempt from disclosure under applicable law. If the reader of this message
is not the intended recipient, or the employee or agent responsible for
delivering the message to the intended recipient, you are hereby notified
that any dissemination, distribution or copying of this communication is
strictly prohibited. If you have received this communication in error,
please notify Ziath Ltd immediately by email at [email protected]. Thank you.
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en.