Hey all, Has anyone ever altered an R package for image analysis to do optical mark recognition? I'm trying to find a way to semi-automate data entry of several thousand paper health surveys that are predominantly composed of check boxes. All the boxes are uniform size and shape, so it seems as if it should be possible to alter a package to recognize the boxes and output a 0 or 1 to correspond to whether the box is empty or not. From there, I could write if/then statements to convert the output into the relevant answers to the questions in the survey, and export it for analysis.
I've been playing around with imageHTS and EBImage, but haven't been able to alter the configuration files that come with the packages for use in identifying cells, electrophorisis screens, etc. The surveys all have a line down the middle and an ID number on each page, which should serve as anchors for the program as it identifies the boxes. My question is whether or not anyone has ever tried anything like this, or if you even think it can be done. Having spent a week on this, I'm starting to doubt my initial assumption that it should be an easy alteration to make. Thanks for any input you might have, Kirsten Simmons -- Kirsten Simmons, MPH Polymath interested in productivity, how ideas spread, gardening, marketing, entrepreneurship and models of social networks and disease transmission [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.