See Pix.RemoveLines() 
<https://github.com/charlesw/tesseract/blob/develop/src/Tesseract/Pix.cs>.


On Tuesday, August 30, 2016 at 8:09:55 AM UTC-5, shripad shirsat wrote:
>
> Thank you very much for your valuable suggestion. Can you just help me out 
> in how to remove the horizontal lines as I am processing this image in C# 
> code and is there any tool which i can use to remove the horizontal line or 
> any code snippet i can refer.
>
> On Saturday, August 27, 2016 at 10:04:10 PM UTC+5:30, Quan Nguyen wrote:
>>
>> Deskew, grayscale, remove lines, binarize produced the image:
>>
>>
>> <https://lh3.googleusercontent.com/-k4IAE2W2W7M/V8HAYJhIP5I/AAAAAAAAAqg/C85uxC7JDOMikMfAX_whlGB8UBU2Y1BiACLcB/s1600/Capture4.PNG>
>>
>> and OCRed text:
>>
>> l4|0|0l2|1l1>°l0|7l
>>
>> So if you could remove the vertical lines, it would improve further.
>>
>> On Saturday, August 27, 2016 at 10:29:52 AM UTC-5, shripad shirsat wrote:
>>>
>>>
>>> I am facing to issue to recognize the numbers from pdf which are printed 
>>> within the boxes. I have used tesseract in C# for my project. Kindly some 
>>> one help me out with any clue or hint or a snippet to how to go about to 
>>> find the solution for the same. Please find the attached pdf
>>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/731835d1-1e35-4341-b969-6064a808eeb9%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to