Hello everyone,
I have perused the past email archives for discussion on "partial_fit" and
the online-learning APIs. As per my understanding, having a "partial_fit"
kind of API essentially modifies a supervised learning classifier so that
it can accept a continuous stream of data.
In that case, m
Hi Andy,
An explanation of Gittin's index:
Imagine you are in a casino which has N slot machines. Each machine can be
played by inserting a 1-dollar coin in it and pulling a lever. Every time
you pull a lever, the machine might spit out some amount of money, governed
by an underlying (unknown) st
Hi Shankar.
Can you explain in how far these two ideas are related to online learning?
I am not familiar with Gittin's index but value iteration and policy
iteration
are dynamic programming algorithms that assume a model is given.
I would implement those by either giving the full transition mod
Hello everyone,
I am a prospective GSOC-2012 student. I have some project ideas that i
would like to bounce-off the community:
I would like add online-learning functionality. To do so, we can implement
some reinforcement-learning algorithms. The problem is described in terms
of an "agent" that ne