Towards Intelligent Agents That Can See, Talk, and Act  is coming at
03/11/2019 - 10:00am

KEC 1007
Mon, 03/11/2019 - 10:00am

Stefan Lee
Research Scientist, School of Interactive Computing, Georgia Tech

Abstract:
For AI agents to fully step into the role of human collaborators, they must
be able to perceive their environment and communicate about this
understanding with humans in order to coordinate their actions to achieve
mutual goals. The development of such holistic agents presents challenging
problems for computer vision, natural language processing, and machine
learning. Towards this end, I'll discuss a recent line of work developing
agents that communicate in natural language regarding visual scenes including
both static images and 3D environments. First, I will focus on work
developing agents that engage in visually-grounded, question-answer based
dialogs -- a task we call Visual Dialog. I will provide an overview of the
Visual Dialog task and highlight some challenges faced by deep agents trained
for this problem. Then I will discuss follow-up work in which we address some
of these challenges by modeling Visual Dialog as a cooperative game between
agents in a reinforcement learning setting -- learning dialog agent policies
end-to-end, from pixels to multi-agent, multi-round dialog to game reward.
Finally, I'll discuss EmbodiedQA, a recent effort to extend beyond static
images and ground similar agents into simulated 3D environments.

Bio:

Read more:
http://eecs.oregonstate.edu/colloquium/towards-intelligent-agents-can-se... 
[1]


[1] 
http://eecs.oregonstate.edu/colloquium/towards-intelligent-agents-can-see-talk-and-act
_______________________________________________
Colloquium mailing list
[email protected]
https://secure.engr.oregonstate.edu/mailman/listinfo/colloquium

Reply via email to