[Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

Shankar Satish Sat, 10 Mar 2012 23:48:02 -0800

Hello everyone,

I am a prospective GSOC-2012 student. I have some project ideas that i
would like to bounce-off the community:


I would like add online-learning functionality. To do so, we can implement
some reinforcement-learning algorithms. The problem is described in terms
of an "agent" that needs to choose between various possible actions, and it
gets feedback based on the results. Reinforcement learning techniques are
concerned with finding optimal policies for the agent under various
conditions.

Specifically, i have 2 algorithms in mind to start off with:

1. Computation of the Gittin's index, which is an optimal solution to the
multi-armed bandit problem: http://en.wikipedia.org/wiki/Multi-armed_bandit
2. Value iteration/policy iteration:
http://en.wikipedia.org/wiki/Markov_decision_process#Value_iteration

So, what do you think about these ideas in general? Also, are they suitable
candidates for a GSOC project?

regards
shankar.

------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

[Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

Reply via email to