>
> Thanks for the reply Nick. I'm doing it. It is very hard ti figure out the 
> functionality of methods without understanding the whole project. Since I 
> have to find out what are those header files do and the relation, it is 
> going to take a lot of time. I'd appreciate if anyone can point me out 
> where the outputs (the extracted text from table) being passed. So that I 
> can add html table tags to the output to reproduce the table in html 
> format.

Anbu   

On Tuesday, April 8, 2014 9:08:30 PM UTC+5:30, Nick White wrote:
>
> Documentation for the internals of Tesseract is unfortunately rather 
> minimal, indeed. I'd recommend you take a look at the TableFinder 
> class in the code to figure it out. And please do share anything you 
> learn here! 
>
> Nick 
>
> On Mon, Apr 07, 2014 at 02:45:51AM -0700, ANBU J wrote: 
> > It's sad that we couldn't find a documentation for the methods for table 
> > manipulation in tesseract. Looks like I have to manually implement an 
> algorithm 
> > to handle tables. 
> > if you have done it already, please share the knowledge.   
> > 
> > On Tuesday, 25 June 2013 14:42:46 UTC+5:30, [email protected] wrote: 
> > 
> >     Hi ! 
> > 
> >     I'm going to work for a program which can recognize the table 
> structure and 
> >     text in this table. 
> >     I tried to OCR the table image using command line on Windows 7, but 
> the 
> >     output text was so bad. 
> > 
> >     (just like this: tesseract table.jpg out -l eng, or with "hocr") 
> >     I tried to using TessBaseAPI in VC too.(just a simple application) 
> > 
> >     The table lines(especially column) interfere in the whole image. 
> > 
> >     And now, I find the Class "TableFinder" in Tesseract source code, 
> but I 
> >     can't get anything else from Internet. (Tesseract-OCR-3.02) 
> >     No demos, teachings here? 
> > 
> >     I am new, sincerely hope to get some help.  :) 
> > 
> >     Thanks! 
> > 
> > -- 
> > -- 
> > You received this message because you are subscribed to the Google 
> > Groups "tesseract-ocr" group. 
> > To post to this group, send email to 
> > [email protected]<javascript:> 
> > To unsubscribe from this group, send email to 
> > [email protected] <javascript:> 
> > For more options, visit this group at 
> > http://groups.google.com/group/tesseract-ocr?hl=en 
> > 
> > --- 
> > You received this message because you are subscribed to the Google 
> Groups 
> > "tesseract-ocr" group. 
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email 
> > to [email protected] <javascript:>. 
> > For more options, visit https://groups.google.com/d/optout. 
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to