Re: [Scikit-learn-general] Active learning strategies with scikit-learn

2013-10-04 Thread Josh Wasserstein
Thank you all for the pointers. This gives me a great place to start. I will let you know if I find anything that is useful sharing with the list. Josh On Fri, Oct 4, 2013 at 9:42 AM, Emanuele Olivetti wrote: > Hi Josh, > > Some years ago I used to work on a similar problem, i.e. to decide whi

Re: [Scikit-learn-general] Links to github sources in docs?

2013-10-04 Thread Jaques Grobler
Hi Michal, There is an issue open for this: https://github.com/scikit-learn/scikit-learn/issues/1680?source=cc Feel free to join in Cheers, J 2013/10/4 Michal Romaniuk > I'm not sure if it's feasible but it would be nice to have links to > github sources in the online docs. When I'm writing

[Scikit-learn-general] Links to github sources in docs?

2013-10-04 Thread Michal Romaniuk
I'm not sure if it's feasible but it would be nice to have links to github sources in the online docs. When I'm writing my own transforms, I often browse the docs for something with a similar interface and look up the sources to see how it's implemented. A direct link would be useful :-) Cheers, M

Re: [Scikit-learn-general] Active learning strategies with scikit-learn

2013-10-04 Thread Emanuele Olivetti
Hi Josh, Some years ago I used to work on a similar problem, i.e. to decide which attributes of which instances should be measured in order to reach a given goal (in our case: to learn which features were important and which ones were not, with respect to class labels). Note that this formul

Re: [Scikit-learn-general] Active learning strategies with scikit-learn

2013-10-04 Thread Byron
This sounds a bit like *active class selection* rather than traditional active learning. See this paper: http://dl.acm.org/citation.cfm?id=1421731. You might also try the active learning mailing list: [email protected]. Best, -Byron On Thu, Oct 3, 2013 at 10:01 AM, Josh Wasser

Re: [Scikit-learn-general] Active learning strategies with scikit-learn

2013-10-04 Thread Olivier Grisel
2013/10/3 Josh Wasserstein : > Hello, > > I work in a classification problem where each instance has several > attributes (e.g. the age of an individual). However, collecting instances > (either labeled or unlabeled) is very expensive, since it requires asking > domain experts to spend a significan