Scenario: We have 7000+ electric meters being changed out, and while changing them out we are taking a picture of the new meter beside the old meter to capture the previous reading. We are looking for a way to extract the meter number from all 7000 pictures programmatically. I have gotten as far as creating a batch script to run tesseract for all files in a folder, and create output txt files for all of the images. Within these images I see a bunch of jarbled text, and eventually I find the meter number. My question, can I extract just that meter number out of the images programmatically? I have a list of all 7000 meter numbers, and considered maybe making a dictionary file of just these. Would that possibly work? Can tesseract be set to ignore anything that isn't a dictionary match?
Sample meter file: http://deangrell.com/CIMG0005.tif The meter number we are trying to read is on the left, 76 207 799. Everything pulls across, even the "SANAGAMO" on the bottom of the right meter. This software is truly impressive, I just need to find a way to focus it on the meter numbers. Any help at all would be appreciated! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

