Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Shankar Satish
Hello everyone, I have perused the past email archives for discussion on "partial_fit" and the online-learning APIs. As per my understanding, having a "partial_fit" kind of API essentially modifies a supervised learning classifier so that it can accept a continuous stream of data. In that case, m

Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Shankar Satish
Hi Andy, An explanation of Gittin's index: Imagine you are in a casino which has N slot machines. Each machine can be played by inserting a 1-dollar coin in it and pulling a lever. Every time you pull a lever, the machine might spit out some amount of money, governed by an underlying (unknown) st

Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Andreas Mueller
Hi Shankar. Can you explain in how far these two ideas are related to online learning? I am not familiar with Gittin's index but value iteration and policy iteration are dynamic programming algorithms that assume a model is given. I would implement those by either giving the full transition mod

[Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-10 Thread Shankar Satish
Hello everyone, I am a prospective GSOC-2012 student. I have some project ideas that i would like to bounce-off the community: I would like add online-learning functionality. To do so, we can implement some reinforcement-learning algorithms. The problem is described in terms of an "agent" that ne