[UAI] Call for papers: NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning

Pascal Poupart Sun, 21 Sep 2008 08:51:59 -0700


Call for papers
NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning
http://www.cs.uwaterloo.ca/~ppoupart/nips08-workshop.html
Whistler, BC, Canada
December 13, 2008



Important Dates
---------------
* Oct 30: submission deadline
* Nov 4: notification of acceptance


Overview
--------

Reinforcement Learning (RL) problems are typically formulated in terms ofStochastic Decision Processes (SDPs), or a specialization thereof,Markovian Decision Processes (MDPs), with the goal of identifying anoptimal control policy. In contrast to planning problems, RL problems arecharacterized by the lack of complete information concerning thetransition and reward models of the SDP. Hence, algorithms for solving RLproblems need to estimate properties of the system from finite data.Naturally, any such estimated quantity has inherent uncertainty. One ofthe interesting and challenging aspects of RL is that the algorithms havepartial control over the data sample they observe, allowing them toactively control the amount of this uncertainty, and potentially trade itoff against performance.

Reinforcement Learning as a field of research, has over the past few yearsseen renewed interest in methods that explicitly consider theuncertainties inherent to the learning process. Indeed, interest indata-driven models that take uncertainties into account, goes beyond RL tothe fields of Control Theory, Operations Research and Statistics. Withinthe RL community, relevant lines of research may be classified into thefollowing (partially overlapping) sub-fields:

1- Bayesian RL. Bayesian methods attempt to explicitly model uncertaintiesusing posterior probability distributions, computed using Bayes' rule.Such Bayesian modeling may be used in estimating the MDP's transition andreward distributions; or in estimating other quantities that are moredirectly related to performance, such as value function and policygradient.

2- Risk sensitive and robust dynamic decision making. These methods useinformation beyond the expected return, to compute policies that arerobust to inaccuracies in the estimated model. Such quantities includequantiles, as well as higher order moments of the return random variable.A closely related family of methods use expectations of non-linearmappings of the return, as their measures of performance.

3- RL with confidence intervals. This research is concerned with methodsthat employ Frequentist measures of model uncertainties, based onconfidence intervals. Much of this research is focused on on-linealgorithms, whose performance is evaluated concurrently with the learningprocess.

4- Applications of risk-aware and uncertainty-aware decision-making.Applications in mission critical tasks, finance, and other risk-sensitivedomains, where uncertainties have to be taken into account, in order toestablish a level of worst-case performance, or to guarantee a minimumlevel of performance that may be achieved with high probability.

This workshop is aimed at bringing together researchers working in theseand related fields, allow them to present their current research, anddiscuss possible directions for future work. We intend to focus onpossible interactions between the sub-fields listed above, as well as oninteractions with other related fields, which are outside of the currentRL mainstream.



Workshop format
---------------

This is a one-day workshop consisting of:

1- Invited talks

2- Contributed talks

3- Panel discussions

3.1- Models that work and those that don't: participants will discussspecific applications and theoretical models and share experience regardingthe effectiveness of different approaches.3.2- Benchmarks and challenges: discussion of some proposals for sampleproblems that encompass the core challenges of model uncertainty andrisk sensitive control that could serve as benchmarks and/or challenges.


4- Poster session


Call for Contributions
----------------------

Participants are invited to submit either a technical paper (eight pagesin the conference format) or an extended abstract (up to two pages)describing research relevant to the workshop. Submissions should be sentvia email to Pascal Poupart at [EMAIL PROTECTED] by Oct 30thin Postscript, PDF, or MS Word format. Previously published work that isreworded, summarized or extended may be submitted to the workshop.However, priority will be given to novel work. If the papers are ofsufficient quantity and quality, we will seek to publish them as anedited book or journal special issue.



Important Dates
---------------

Oct 30: submission deadline
Nov 4: notification of acceptance
Dec 13: workshop in Whistler


Workshop webpage
----------------

http://www.cs.uwaterloo.ca/~ppoupart/nips08-workshop.html


Organizing Committee
--------------------

1- Yaakov Engel ([EMAIL PROTECTED])
2- Mohammad Ghavamzadeh (INRIA - Team SequeL, [EMAIL PROTECTED]),
3- Shie Mannor (McGill University, [EMAIL PROTECTED])
4- Pascal Poupart (University of Waterloo, [EMAIL PROTECTED])


--
------------------------
Pascal Poupart
Assistant Professor
David R. Cheriton School of Computer Science
University of Waterloo
200 University Avenue West
Waterloo, Ontario
Canada N2L 3G1
------------------------
Web: http://www.cs.uwaterloo.ca/~ppoupart

Email: [EMAIL PROTECTED]Telephone: 1-519-888-4567x36239Fax: 1-519-885-1208

------------------------

_______________________________________________
uai mailing list
[email protected]
https://secure.engr.oregonstate.edu/mailman/listinfo/uai

[UAI] Call for papers: NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning

Reply via email to