Re: Table Analysis and Cell Labeling

Neo Song Mon, 25 Jun 2012 06:27:18 -0700

Dear All,

    Again I update this thread for help. Could I get the table cells based 
on the table finder results?


在 2012年6月19日星期二UTC+8下午4时26分33秒，Neo Song写道：
>
> Dear All,
>
>     Currently I am doing a table text extraction project, and we need to 
> identify the table before any OCR process. 
>     I investigate the related source code (checked out version:r729), and 
> found the there is a table finder class inside tesseract (tablefind.cpp). 
> The problem is that for the irregular tables(e.g. different rows have 
> different columns), even if I got all the ruling lines, I can not identify 
> the concrete table cells.
>     I have called the function "FindLinesCreateBlockList()" and I can 
> iterate all the text block, horizontal lines and vertical lines in the 
> target image. However I can do nothing with these horizontal lines and 
> vertical lines, what I need is something like a CELL_LIST, which contains 
> every table cell in a reading order based on table ruling lines. I believe 
> that the table finder may already contain such a algorithm(I read the code 
> but it is too much complicated), but not exposed to Base API interface. Is 
> it true?
>     Can someone help me out of this? How to obtain the table cells? An 
> example of such irregular table can be found in the attachment. 
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Re: Table Analysis and Cell Labeling

Reply via email to