Dear  temp name,

yes I tried "table without border.png", because you wrote "This image is
also not recognized correctly" (and you attached image table without
border.png).

With 3.03 you will get better results: e.g.: "0 0121 A" for image "first.
PNG".

I think solution could be "custom segmentation" (with tesseract API or uzn
file in case of tesseract executable) E.g. (tested with 3.03 version and
and I renamed your file to first.png ;-) ):

   - put attached uzn file to the same directory as first.png
   - run 'tesseract first.png - -psm 7' and you will get result "0|
   0121|A|D" - there are information from all table cells separated by |.

Of course you can not rely on this (cell border recognized as |), but I
think is good demonstration what can be achieved with "custom segmentation"

Zdenko


On Tue, Mar 18, 2014 at 6:46 AM, temp name <[email protected]> wrote:

>
> Zdenko,
>
> you must have tried psm mode with image table_without_border. My query was
> regarding the image first.png . The image contains a table with single row
> and multiple column. Tesseract doesn't produce correct output for that
> image. I have tried all the page segment mode with this image. I have
> attached the image. Please suggest any solution for this.
>
> Thanks.
>
> On Monday, March 17, 2014 2:34:09 AM UTC+5:30, zdenop wrote:
>
>> Dear  temp name,
>>
>> You wrote "I tried all the page segment mode, but it didn't solved my
>> problem.". I tried correct  page segment mode and I got attached output. I
>> tried it with tesseract v3.02.02 and v3.03 RC1 on openSUSE 13.1 64bit.
>>
>> Are you sure that you tried ALL page segment mode?
>>
>> Zdenko
>>
>>
>> On Sun, Mar 16, 2014 at 8:56 PM, temp name <[email protected]> wrote:
>>
>>>  Zdenko,
>>>
>>> I tried all the page segment mode, but it didn't solved my problem.
>>>
>>> I tried outputting the page segment image. This image is also not
>>> recognized correctly.
>>>
>>> Please reply soon.
>>>
>>>
>>> On Thursday, March 13, 2014 2:49:56 AM UTC+5:30, zdenop wrote:
>>>
>>>> Did you read the wiki[1]?
>>>>
>>>> [1] https://code.google.com/p/tesseract-ocr/wiki/ImproveQual
>>>> ity#Segmentation_method
>>>>
>>>> Zdenko
>>>>
>>>>
>>>> On Thu, Feb 27, 2014 at 8:35 AM, temp name <[email protected]> wrote:
>>>>
>>>>>  Hello,
>>>>>
>>>>> I have an image which contains only table. When I used tesseract for
>>>>> OCR it doesn't recognize text from first and last column.
>>>>> I have attached the image which I have used.
>>>>>
>>>>> After lot of trial, when I modified the image manually and added some
>>>>> text over the table before feeding image to the tesseract.
>>>>> I got correct results. I am not able to understand the logic here. why
>>>>> tesseract don't recognize first image but it recognized second image.
>>>>> I suspect it is because of table border analysis or page border
>>>>> analysis in preprocessing phase of tesseract. if this is the case, then
>>>>> tesseract should have a parameter which can force tesseract to ignore page
>>>>> border analysis. I need some help urgently. Any idea?
>>>>>
>>>>> P.s.I have attached the two images.
>>>>>
>>>>> --
>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To post to this group, send email to [email protected]
>>>>>
>>>>> To unsubscribe from this group, send email to
>>>>> [email protected]
>>>>>
>>>>> For more options, visit this group at
>>>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>>>
>>>>> ---
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to [email protected].
>>>>>
>>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>>>
>>>>
>>>>  --
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To post to this group, send email to [email protected]
>>> To unsubscribe from this group, send email to
>>> [email protected]
>>> For more options, visit this group at
>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>
>>> ---
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>  --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/d/optout.
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Attachment: first.uzn
Description: Binary data

Reply via email to