See
http://scikit-learn.org/dev/faq.html#what-are-the-inclusion-criteria-for-new-algorithms
and
http://scikit-learn.org/dev/faq.html#why-is-there-no-support-for-deep-or-reinforcement-learning-will-there-be-support-for-deep-or-reinforcement-learning-in-scikit-learn

Bandit algorithms require a fundamentally different kind of interface than what's in scikit-learn right now, as they are sequential decision making algorithms.

On 09/04/2018 01:23 PM, Touqir Sajed wrote:
Hi,

This email is intended to initiate a discussion on whether it is worth adding Multi-Armed Bandit (MAB) algorithms in Scikit-learn. For those of you who have not heard of MAB algorithms, they are the simplest form of decision-making algorithms applicable whenever data with labels are not given beforehand and the objective is to try out different decisions, whenever a sample is seen, and learn which decision is the best in the long run. They are the simplest form of Reinforcement Learning algorithms. While they are not applicable for every decision-making tasks, they naturally fit into a number of problem settings where they are more sample efficient and simpler than the more advanced RL algorithms. For a number of applications : https://www.quora.com/In-what-kind-of-real-life-situations-can-we-use-a-multi-arm-bandit-algorithm. If <https://ml-trckr.com/link/https%3A%2F%2Fwww.quora.com%2FIn-what-kind-of-real-life-situations-can-we-use-a-multi-arm-bandit-algorithm.%C2%A0If/0oMKOPmYKsXmGqYdOxqi> you want to know more about their usage, how they work or their advantages, feel free to let me know!

I do feel that MAB algorithms should be a part of Scikit-learn since a lot of the interesting problems that we face regarding learning is about decision making. There are quite a few github repos with MAB implementations but their coverage is extremely limited and I do not know of any dedicated library on MABs. Companies like Yahoo, Microsoft, Google use MABs for Ad recommendation and search engine optimization but their code is not made public.

Cheers,
Touqir

--
Computing Science Master's student at University of Alberta, Canada, specializing in Machine Learning. Website : https://ca.linkedin.com/in/touqir-sajed-6a95b1126 <https://ml-trckr.com/link/https%3A%2F%2Fca.linkedin.com%2Fin%2Ftouqir-sajed-6a95b1126/0oMKOPmYKsXmGqYdOxqi>


_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to