Re: [scikit-learn] Multi Armed Bandit Algorithms in Scikit-learn

Andreas Mueller Tue, 04 Sep 2018 10:56:33 -0700

See
http://scikit-learn.org/dev/faq.html#what-are-the-inclusion-criteria-for-new-algorithms
and
http://scikit-learn.org/dev/faq.html#why-is-there-no-support-for-deep-or-reinforcement-learning-will-there-be-support-for-deep-or-reinforcement-learning-in-scikit-learn

Bandit algorithms require a fundamentally different kind of interfacethan what's in scikit-learn right now, as they are sequential decisionmaking algorithms.


On 09/04/2018 01:23 PM, Touqir Sajed wrote:

Hi,
This email is intended to initiate a discussion on whether it is worthadding Multi-Armed Bandit (MAB) algorithms in Scikit-learn. For thoseof you who have not heard of MAB algorithms, they are the simplestform of decision-making algorithms applicable whenever data withlabels are not given beforehand and the objective is to try outdifferent decisions, whenever a sample is seen, and learn whichdecision is the best in the long run. They are the simplest form ofReinforcement Learning algorithms. While they are not applicable forevery decision-making tasks, they naturally fit into a number ofproblem settings where they are more sample efficient and simpler thanthe more advanced RL algorithms. For a number of applications :https://www.quora.com/In-what-kind-of-real-life-situations-can-we-use-a-multi-arm-bandit-algorithm. If<https://ml-trckr.com/link/https%3A%2F%2Fwww.quora.com%2FIn-what-kind-of-real-life-situations-can-we-use-a-multi-arm-bandit-algorithm.%C2%A0If/0oMKOPmYKsXmGqYdOxqi>you want to know more about their usage, how they work or theiradvantages, feel free to let me know!
I do feel that MAB algorithms should be a part of Scikit-learn since alot of the interesting problems that we face regarding learning isabout decision making. There are quite a few github repos with MABimplementations but their coverage is extremely limited and I do notknow of any dedicated library on MABs. Companies like Yahoo,Microsoft, Google use MABs for Ad recommendation and search engineoptimization but their code is not made public.
Cheers,
Touqir

--
Computing Science Master's student at University of Alberta, Canada,specializing in Machine Learning. Website :https://ca.linkedin.com/in/touqir-sajed-6a95b1126<https://ml-trckr.com/link/https%3A%2F%2Fca.linkedin.com%2Fin%2Ftouqir-sajed-6a95b1126/0oMKOPmYKsXmGqYdOxqi>
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] Multi Armed Bandit Algorithms in Scikit-learn

Reply via email to