[scikit-learn] top N accuracy classification metric

2017-01-20 Thread Johnson, Jeremiah
Hi all, It's common to use a top-n accuracy metric for multi-class classification problems, where for each observation the prediction is the set of probabilities for each of the classes, and a prediction is top-N accurate if the correct class is among the N highest predicted probability classes

[scikit-learn] Identify spectra with "marker"

2017-01-20 Thread Sebastian Illner
Hi guys, I'm new to NIR-measurement as wenn as chemometrics. My current project involvs the recognition of determined spectra (of a reference system) among others. The materials are currentlys not really set. So I try to give a predetermined mixture of substances into another matrix and group t

[scikit-learn] PR #8190: "Implement Complement Naive Bayes."

2017-01-20 Thread Michael Alcorn
Hi all, I would appreciate it if a couple of maintainers could take a look at my pull request (https://github.com/scikit-learn/scikit-learn/pull/8190) implementing the Complement Naive Bayes (CNB) classifier described in Rennie et al. (2003). CNB regularly outperforms the standard Multinomial Naiv

[scikit-learn] Pipeline conventions for wrappers

2017-01-20 Thread Aaditya Jamuar
Hi Guys, I am currently working on gensim ( https://github.com/RaRe-Technologies/gensim) , writing wrappers for Scikit-learn for easy integration of LDA ( https://github.com/RaRe-Technologies/gensim/pull/932/files). While I have covered most of the API conventions as specified on scikit-learn's w