Towards Intelligent Agents That Can See, Talk, and Act is coming at 03/11/2019 - 10:00am
KEC 1007 Mon, 03/11/2019 - 10:00am Stefan Lee Research Scientist, School of Interactive Computing, Georgia Tech Abstract: For AI agents to fully step into the role of human collaborators, they must be able to perceive their environment and communicate about this understanding with humans in order to coordinate their actions to achieve mutual goals. The development of such holistic agents presents challenging problems for computer vision, natural language processing, and machine learning. Towards this end, I'll discuss a recent line of work developing agents that communicate in natural language regarding visual scenes including both static images and 3D environments. First, I will focus on work developing agents that engage in visually-grounded, question-answer based dialogs -- a task we call Visual Dialog. I will provide an overview of the Visual Dialog task and highlight some challenges faced by deep agents trained for this problem. Then I will discuss follow-up work in which we address some of these challenges by modeling Visual Dialog as a cooperative game between agents in a reinforcement learning setting -- learning dialog agent policies end-to-end, from pixels to multi-agent, multi-round dialog to game reward. Finally, I'll discuss EmbodiedQA, a recent effort to extend beyond static images and ground similar agents into simulated 3D environments. Bio: Read more: http://eecs.oregonstate.edu/colloquium/towards-intelligent-agents-can-se... [1] [1] http://eecs.oregonstate.edu/colloquium/towards-intelligent-agents-can-see-talk-and-act
_______________________________________________ Colloquium mailing list [email protected] https://secure.engr.oregonstate.edu/mailman/listinfo/colloquium
