Re: [tesseract-ocr] Help extracting text from images.

newbie Thu, 08 Jan 2015 08:26:18 -0800

It worked YAY!, you have all my gratitude!. ok now I need to know how you 
did the resampling. I thought you said you took the cropped image and 
resampled. But this seems like the original png file(Arris2500.png) 
resampled. Let me know how you went about resampling and how I can acheive 
it programatically.


Thanks

On Thursday, January 8, 2015 11:06:33 AM UTC-5, Allistair C wrote:
>
> Hi,
>
> I've not used tess4j but the JavaDocs show that it should be possible to 
> set TessAPI.TessPageSegMode:
>
>
> http://tess4j.sourceforge.net/docs/docs-1.0/net/sourceforge/tess4j/TessAPI.html
>
>
> http://tess4j.sourceforge.net/docs/docs-1.2/net/sourceforge/tess4j/TessAPI1.TessPageSegMode.html
>
> The 3000 resampled image was:
>
> https://dl.dropboxusercontent.com/u/523401/ArrisVIP2500_3000.png
>
> Cheers
>
> On 8 January 2015 at 15:35, newbie <[email protected] <javascript:>> 
> wrote:
>
>> Allistair,
>>             Thanks for taking the time to respond . Do you know how to 
>> use psm 6 in tess4j(its probably an argument to the instantiator, need to 
>> look up the src code) ? I have not seen any examples of it being used by 
>> googling.. I tried to resample the cropped image to 3000 px(horizontall 
>>  using paint) like you suggested and ran it thro tess4j and it still did 
>> not recognize my model number. Gave me an output of "VIPZSOO". So I guess 
>> piping it thro psm 6 is the key. Also can u send me the image that was 
>> produced after you resampled it to 3000px, so that I know my resampling is 
>> right.
>>
>> I also like your idea of providing the white box in the camera view to 
>> use it as my input to cropping . Sure can do that. 
>> I think I am glad discussed the feature matching - that seems more like 
>> object recognition than text recognition. So probably is far fetched. I had 
>> used camFlow(an app) to see if it would recognize my equipment images and 
>> it always came back with "Black media player". So they probably are using 
>> feature matching of openCV.
>>
>> Thanks again and appreciate your taking time to respond.
>>
>>
>> On Wednesday, January 7, 2015 6:12:05 PM UTC-5, Allistair C wrote:
>>>
>>> It sort of depends on your hardware and how similar or different they 
>>> are. Reliable feature matching works on distinct features (so there need to 
>>> be enough points of interest (edges usually) that cover text, buttons, 
>>> other bits and pieces). If, for example, all your hardware was the same as 
>>> the example you originally posted and only the model number was changing 
>>> then this would be an issue most likely as the feature matching may match 
>>> several targets. 
>>>
>>> Also you mention the tech takes a picture on mobile. Does that need to 
>>> be looked up immediately? The issue is that feature matching is CPU heavy 
>>> and can take time on mobile and is a function of the photo resolution. 
>>> Luckily, feature matching appears to work better on lower resolution images 
>>> and most of the time works in black and white. Then there is the potential 
>>> number of hardware items you are trying to match. The most advanced mobile 
>>> augmented reality products (Metaio, Vuforia) that use feature matching only 
>>> allow up to 100 targets to be "tracked" or "looked for" at a time - every 
>>> piece of hardware you are looking for needs to be compared to the live 
>>> input camera view (or photo) and this is the part that hits the CPU hard. 
>>> If however there was an option to offload the image(s) to a backend cloud 
>>> server for feature match or if the tech did not need an instant or any kind 
>>> of result in the field, then you are in a better situation as you can stand 
>>> up serious computing power.
>>>
>>> It's not easy to recommend one or the other without all the facts - as 
>>> you begin to mention new things like mobile and techs in the field, this 
>>> changes things :) For instance I also used mobile - an Android tablet, with 
>>> Open CV and Tesseract OCR - the combination worked in the field - the tech 
>>> can position the camera face-on to the model number and take a close photo. 
>>> You could even provide a mini App for your techs that has a basic cropping 
>>> tool. The technique I used was to show the camera view in my app with a 
>>> little white transparent box over the camera view that allowed the user to 
>>> position the text to fit that white box. Then, when the photo was taken I 
>>> simply cropped that white box coordinate rectangle and I had a perfect 
>>> match. This was easy vs. feature matching :)
>>>
>>> On Wednesday, 7 January 2015 23:02:09 UTC, newbie wrote:
>>>>
>>>> Sorry for the barrage here.
>>>> The interesting thing is you mentioned feature matching with openCV(I 
>>>> dont know anything at all about it). But the one thing is I can have a 
>>>> repository of these images with me and I need to match it to one of the 
>>>> user generated image.
>>>>
>>>> A little background might help. I can(or come up with) have a 
>>>> repository of all the equipment images with me. A tech might head to the 
>>>> field, take a picture on his mobile device and  I need to match it(tech's 
>>>> picture) against my repository and come up with the model number.
>>>>
>>>> Is this easier with ocr or feature matching with openCV ?
>>>>
>>>> Thanks
>>>>
>>>> On Wednesday, January 7, 2015 5:35:47 PM UTC-5, newbie wrote:
>>>>>
>>>>> Thanks Allistair , my lucky day as you have responded to both my 
>>>>> queries. Let me try to address your questions below and then go ahead 
>>>>> with 
>>>>> a few of my own :-)
>>>>>
>>>>> *I also meant to ask whether your use case allows for cropping. If you 
>>>>> know you will have a certain format of image, cropping an area and 
>>>>> resampling should be easy.*
>>>>> Basically the image will be an user generated image, more like the 
>>>>> first png file, but we could ask the user to zoom in to the model number, 
>>>>> if that would help us indentify the model number.we could do anything 
>>>>> with 
>>>>> the image(cropping ,resampling etc). But the problem is the model number 
>>>>> probably will not be located at the same place for all equipments.
>>>>>
>>>>> 2. Preprocessing - as it should be programatically done would I be 
>>>>> using opencv in conjunction with tesseract? I did not see much in 
>>>>> tesseract 
>>>>> for image processing(I could be totally off).
>>>>> 3.*.I also use psm 6 for these types of image with various text 
>>>>> locations.*
>>>>>    what is this ?
>>>>>
>>>>> Another thing I probably can come up with is all the model #s or 
>>>>> images of all potential equipments, so I have repository to match 
>>>>> against. 
>>>>> Would that help in any way ?
>>>>>
>>>>> Thanks again for taking the time to respond. Appreciate it.
>>>>>
>>>>>
>>>>>
>>>>> On Wednesday, January 7, 2015 4:44:47 PM UTC-5, Allistair C wrote:
>>>>>>
>>>>>> I also meant to ask whether your use case allows for cropping. If you 
>>>>>> know you will have a certain format of image, cropping an area and 
>>>>>> resampling should be easy. You could also do some preprocessing that 
>>>>>> looks 
>>>>>> for certain icons in your image to get some context as to where the 
>>>>>> model 
>>>>>> number is likely to be (see feature matching on Open CV). However, I 
>>>>>> would 
>>>>>> need to know more about your use case.
>>>>>>
>>>>>> That said, resampling your full image to 3000px wide yielded a result 
>>>>>> with a full model number but the more you can crop the area the better 
>>>>>> the 
>>>>>> result:
>>>>>>
>>>>>> AT&T U verse ‘ §
>>>>>> LINK HD nzc ,
>>>>>> rowzn Q I ‘ .» . ‘ nsuu 4 0|: > I
>>>>>> / sj J \
>>>>>> VIP2500 °%' 7 A R R I s
>>>>>>
>>>>>>
>>>>>> On 7 January 2015 at 21:39, Allistair <[email protected]> wrote:
>>>>>>
>>>>>>> A common technique is to pre-process your input image. 
>>>>>>>
>>>>>>> Resizing produced good results.I also use psm 6 for these types of 
>>>>>>> image with various text locations.
>>>>>>>
>>>>>>> In this case I first used your cropped image:
>>>>>>>
>>>>>>> tesseract ArrisVIP2500_cropped.png out -l eng -psm 6 config
>>>>>>>
>>>>>>> and got:
>>>>>>>
>>>>>>> AT&T U verse
>>>>>>> rowsn
>>>>>>> O F3.
>>>>>>> vrrzsoo ’e'
>>>>>>>
>>>>>>> Then I resampled your image to 2000px wide:
>>>>>>>
>>>>>>> tesseract ArrisVIP2500_cropped_2000.png out2000 -l eng -psm 6 config 
>>>>>>>
>>>>>>> and got:
>>>>>>>
>>>>>>> AT&T U verse
>>>>>>> POWER © " ‘|
>>>>>>> / ‘j""'j"’..
>>>>>>> VIP2500 '%’
>>>>>>>
>>>>>>> Cheers
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On 7 January 2015 at 19:26, newbie <[email protected]> wrote:
>>>>>>>
>>>>>>>> I am using tess4j, a java wrapper around tesseract and Here are the 
>>>>>>>> images and results. The intent is to extract VIP2500(model number) 
>>>>>>>> from the 
>>>>>>>> image. An help is appreciated.
>>>>>>>>
>>>>>>>> Attached are the original png  file ( ArrisVIP2500.png),binarized 
>>>>>>>> file(ArrisVIP2500_bin.TIF) and then a zoomed and cropped 
>>>>>>>> file(ArrisVIP2500_cropped.png).
>>>>>>>>
>>>>>>>> *ArrisVIP2500.png*
>>>>>>>>
>>>>>>>>  é ATE-T U-verse
>>>>>>>>
>>>>>>>> rowan 0
>>>>>>>> / 
>>>>>>>>
>>>>>>>> *ArrisVIP2500_bin.TIF*
>>>>>>>>
>>>>>>>> AT&T U-verse
>>>>>>>>
>>>>>>>> rowan <3 3
>>>>>>>> / --
>>>>>>>>
>>>>>>>> vxvzsoo ‘Q’ 
>>>>>>>>
>>>>>>>> *ArrisVIP2500_cropped.png*
>>>>>>>>
>>>>>>>> ATE-T U-verse
>>>>>>>>
>>>>>>>> rowsn Q 
>>>>>>>>
>>>>>>>> VIPZSOO ‘e’                      This looks the closest to VIP2500 
>>>>>>>> , I need to get tess4j to reconginze digits, that said, this might not 
>>>>>>>> be a 
>>>>>>>> realistic scenario, as someone/something
>>>>>>>>
>>>>>>>>                                            Needs to zoom and crop 
>>>>>>>> the image before hand(preprocessing).
>>>>>>>>
>>>>>>>> -- 
>>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>>> Groups "tesseract-ocr" group.
>>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>>> send an email to [email protected].
>>>>>>>> To post to this group, send email to [email protected].
>>>>>>>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>>>>>>>> To view this discussion on the web visit 
>>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/009ffbc7-
>>>>>>>> 90cc-417a-90c8-b4ac9b5bb203%40googlegroups.com 
>>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/009ffbc7-90cc-417a-90c8-b4ac9b5bb203%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>>>> .
>>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/e6bd4bf6-ad6e-4bef-bff7-6397c924f42b%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/e6bd4bf6-ad6e-4bef-bff7-6397c924f42b%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/34aadbd6-f211-4ef6-87ac-fd6359c16be0%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [tesseract-ocr] Help extracting text from images.

Reply via email to