Dmitri,

you're right I need to get better quality pictures first. i will work on it 
and hopefully come back with better ones

thanks!


On Monday, May 18, 2015 at 8:03:00 AM UTC-4, Dmitri Silaev wrote:
>
> Hi James,
>
> Here I think more effort needs to be taken for getting better source 
> images. In principle, there are two alternatives: 
> - Get a good quality source image. Then you'll be able to handle it by 
> means of relatively simple preprocessing. Maybe using ImageMagick. Probably 
> you'll be able to use Tesseract.
> - Let any arbitrary image to get to your pipeline. Prepare to develop (or 
> order from a 3rd party) complex image processor, full-fledged programming, 
> etc.
>
> If you choose to go with the first, I suggest the following to be improved 
> to simplify further OCR:
> - Don't use JPEG. Because of that, there's massive bunch of compression 
> artifacts in each of you images. Use lossless PNG instead.
> - Improve lighting. Too dark shots result in overwhelming noise. Either 
> external or use flash. Beware of flares, though. Experiment in order to get 
> best shots.
> - Try to hold camera evenly when shooting (fronto-parallel projection). 
> Otherwise you'd need perspective correction as a preprocessing step. Or at 
> least skew correction. 
> - LCD display area to occupy as much as possible area of the image, 
> centered. Otherwise you'd need background removal, ROI detection or devise 
> heuristics for locating reference points in the image.
>
> If you fix all of the above, you'll probably be able to manage with the 
> homemade ImageMagick scripts and Tesseract. You can send your sample images 
> again, so that we can discuss what can be done further. 
>
> There's a number of training attempts for LCD display fonts on the 
> internet - look for them. They seem to address fonts similar to yours, but 
> in the end you'd probably need to train yourself.
>
> Best regards,
> Dmitri Silaev
> www.CustomOCR.com
>
>
>
>
>
> On Thu, May 14, 2015 at 8:17 PM, James Okken <[email protected] 
> <javascript:>> wrote:
>
>> Dmitri,
>>
>> thanks very much for your response. any help would be huge!
>> anything you suggest for LCD segments would be huge too!
>>
>> I've attached more of the original images.
>>
>> thanks
>>
>> On Thursday, May 14, 2015 at 3:41:14 AM UTC-4, Dmitri Silaev wrote:
>>>
>>> Hi James,
>>>
>>> I can suggest a number of steps regarding connected component analysis 
>>> but it's better you'd show the original photo images. Probably there are 
>>> easier ways to get the numbers from them. Be aware also that Tesseract 
>>> might not be the best way to read LCD segment displays. It can work well 
>>> for you, though; it depends on source image specifics. Attach several 
>>> samples.
>>>
>>> Best regards,
>>> Dmitri Silaev
>>> www.CustomOCR.com
>>>
>>>
>>>
>>>
>>>
>>> On Wed, May 13, 2015 at 8:31 PM, James Okken <[email protected]> wrote:
>>>
>>>> hi everyone.
>>>>
>>>> can tesseract pull the numbers off this thermostat picture attached? 
>>>> I've tried a number of things including making the photo a better quality, 
>>>> to no avail.
>>>>
>>>> any help would be appreciated! thanks!!
>>>>
>>>> -- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit 
>>>> https://groups.google.com/d/msgid/tesseract-ocr/0de8f0b4-dff2-44f0-bd91-bd0403e4d130%40googlegroups.com
>>>>  
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/0de8f0b4-dff2-44f0-bd91-bd0403e4d130%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/41f963d3-e28b-47c2-a65e-c50ccef95530%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/41f963d3-e28b-47c2-a65e-c50ccef95530%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/3caf1f64-90c2-4f53-bb43-5a86e2fa9975%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to