Re: Tracking hands with a web camera

Sam Kass Thu, 17 Jan 2008 19:36:23 -0800

I've experimented a little with this.

Basically, there are a few different tasks, which are not necessarilydone in this order, but:1. Segment the image into the areas you care about (the hand) and theareas you don't

2. Detect the position/motion
3. Determine coordinates
4. Output coordinates to something

There are a few techniques for #1. One is color, as you suggest, butthat's probably going to be fairly problematic as people and clothingcome in many colors. Another is shape... look for a hand-shapedthing, or require the user to hold their fingers in a "V". Another isto do step 2 first, then look for the "end" of the motion, which wouldpresumably be the fingertips. Another is to ignore this stepaltogether and take average motion for the whole scene and hope it'sexact enough.

For #2, someone suggested the optical flow patch. The optical flowalgorithm is pretty handy in computer vision, as it can do everythingfrom detect motion of things in the scene to determine whether thingsare moving closer or further away. It's limitation is that it canonly detect motion in the direction of the brightness gradient. (Soit can't easily detect a ball spinning or a pool cue moving toward acue ball.) A simple diff also works for detecting motion-- especiallyif you don't care as much about direction and speed of motion as youdo location.

Another option for #2 is to actually correlate the points from oneframe to the next using something like the KLT feature trackingalgorithm (Kanade-Lucas-Tomasi). At one point I tried to write a QCplug-in to do that, but it's hacky and slow and I never refined it (http://www.samkass.com/blog/page2/page2.html).

For #3, you talked about mucking with contrast and saturation to drawout the motion. Another option is to put everything in black andwhite with a threshold, do some noise reduction, then just look foredges. In any case, figuring out which are the "interesting"coordinates may end up being the hardest part. You could do a filterthat looked for areas of higher curvature on the diff image and assumethose are fingertips. Or use the average optical flow values tooffset the "current" location to a "new" location... it won't beexact, but you'll be able to "sweep" your hand in a direction and themouse will move that way.

For #4, you'll need a custom QC plug-in. Unless things have changed(I haven't done any serious QC hacking in quite awhile), there's noway to output coordinates from an image input. So put your Objective-C hat on and take a look at the examples...


        --Sam


On Jan 17, 2008, at 10:35 AM, Johnson, Mark P. - Duluth wrote:

I'm trying to figure out the best way to track hands as though aperson in
front of a kiosk were using a mouse (or two).
My strategy so far is to grab an image when the Quartz compositionlaunchesfrom a web cam. Using a trick I saw in PhotoBooth, you can thencompare thecurrent frame of captured video to the background and find thedifference --
that gives you a quick mask of something entering the screen.
Next, I would up contrast and saturation, to exaggerate differences.Bycomparing a frame from a moment before with a current frame, I couldthen
discover motion.
The trick is to get the "most moved" area and perhaps the smallestarea --assuming that a finger is going to be smaller than a head, and isgoing tohave more movement (not always true -- but the user should get thehang of
it in short order).
So I would need to figure out, not only the difference between thepriorimage and the current image for what has moved -- I need to use afalsecolor to get an idea that that tan blob over there (a hand) hasmoved morethan the blue jacket. I'm not completely sure how to do that. Itsort ofoverlaps some of the other color-lookup discussions, where you don'twant tocheck the color of each strand of hair, because lighting conditionscanquickly change -- you want to "blob" the median values of a region.AFTER
you discover the motion of a common region, you can track perhaps a
highlight of the most moved point (like a finger). Luminosity would be
valuable to use the Outline sketch effects to define regions, andthen again
for highlighting that gives dimension.

I'm thinking out loud -- maybe someone has come up with a much easier
strategy to turn a camera into a mouse. I've seen these with videodisplaysat malls and I'm pretty sure they are using InfraRed on the camerato ignoretheir own projection. I may need to get a real video camera and hackit to
grab only infrared -- I'd love advice on that.
I know there is some patch that allows for finding the pixel in arow orcolumn with the greatest or least value. With so many new patch'sI've losttrack of it -- but, after I can somehow visually create an imagewith the"most moved" regions, then I can find an x and y position and usethat as
though it were a mouse.
CIKernel functions that do these operations I'm assuming would befaster --so if anyone can point me to prior work or a way to do this, I'dlove it.
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Quartzcomposer-dev mailing list ([email protected])
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/quartzcomposer-dev/samkass%40samkass.com

This email sent to [EMAIL PROTECTED]


_______________________________________________
Do not post admin requests to the list. They will be ignored.
Quartzcomposer-dev mailing list      ([email protected])
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/quartzcomposer-dev/archive%40mail-archive.com

This email sent to [EMAIL PROTECTED]

Re: Tracking hands with a web camera

Reply via email to