Monday
March 12
4:00 - 4:50 PM 
Kelley 1001

 

Trevor Darrell 
MIT CSAIL



Visual Recognition and Tracking for Perceptive Interfaces

 

Devices should be perceptive, and respond directly to their human user
and/or environment. In this talk I'll present new computer vision
algorithms for fast recognition, indexing, and tracking that make this
possible, enabling multimodal interfaces which respond to users'
conversational gesture and body language, robots which recognize common
object categories, and mobile devices which can search using visual cues
of specific objects of interest. As time permits, I'll describe recent
advances in real-time human pose tracking for multimodal interfaces,
including new methods which exploit fast computation of approximate
likelihood with a pose-sensitive image embedding. I'll also present our
linear-time approximate correspondence kernel, the Pyramid Match, and
its use for image indexing and object recognition, and very recent work
on transfer of predictive structures from previous category recognition
tasks. I'll show interface examples based on these techniques, including
prototypes for grounded multimodal conversation and mobile image-based
information retrieval.

 

Biography:

 

Trevor Darrell leads the Vision Interface Group at the MIT Computer
Science and Artificial Intelligence Laboratory. His interests include
computer vision, multimodal interfaces, and machine learning. Prior to
returning to MIT he worked as a Member of the Research Staff at Interval
Research in Palo Alto, CA, researching vision-based interface
algorithms. He received his PhD and SM from MIT in 1996 and 1991,
respectively, while working at the Media Laboratory, and the BSE from
the University of Pennsylvania in 1988, where he worked in the GRASP
Robotics Laboratory.

_______________________________________________
Colloquium mailing list
[email protected]
https://secure.engr.oregonstate.edu/mailman/listinfo/colloquium

Reply via email to