Interactive Image Search with Attributes

Friday, March 7, 2014 - 8:45am - 10:00am
KEC 1007

Adriana Kovashka
PhD candidate
Department of Computer Science
University of Texas at Austin

Abstract:
Search engines have come a long way, but searching for images is still primarily restricted to meta 
information such as keywords as opposed to the images' visual content. My thesis introduces a new form of 
interaction for image retrieval, where the user can give rich feedback to the system via semantic visual 
attributes (e.g., "metallic", "pointy", and "smiling"). The proposed 
WhittleSearch approach allows users to narrow down the pool of relevant images by comparing the properties of 
the results to those of the desired target.

Building on this idea, I develop a system-guided version of the method which 
actively engages the user in a 20-questions-like game where the answers are 
visual comparisons. This enables the system to obtain that information which it 
most needs to know. To ensure that the system interprets the user's 
attribute-based queries and feedback as intended, I further show how to 
efficiently adapt a generic model for an attribute to more closely align with 
the individual user's perception.

My work transforms the interaction between the image search system and its user 
from keywords and clicks to precise and natural language-based communication. I 
demonstrate the dramatic impact of this new search modality for effective 
retrieval on databases ranging from consumer products to human faces. This is 
an important step in making the output of vision systems more useful, by 
allowing users to both express their needs better and better interpret the 
system's predictions.

Biography: Adriana Kovashka is a PhD candidate in the Department of Computer Science at The University of Texas at Austin. Her advisor is Professor Kristen Grauman. Adriana received her B.A. in Computer Science and Media Studies from Pomona College, CA, in May 2008. Her research interests primarily lie in computer vision, with some overlap in machine learning, information retrieval, natural language processing, and human computation. Her focus is on enhancing the communication between computer vision systems and their human users, particularly for image retrieval. Her research has been published in the top computer vision conferences, such as Computer Vision and Pattern Recognition (CVPR) and the International Conference on Computer Vision (ICCV), as well as the annual conference of the Association for Computational Linguistics (ACL).


_______________________________________________
Colloquium mailing list
[email protected]
https://secure.engr.oregonstate.edu/mailman/listinfo/colloquium

Reply via email to