[scikit-learn] [ANN] Scikit-learn 0.20.0

2018-10-03 Thread Alex Garel
Le 02/10/2018 à 16:46, Andreas Mueller a écrit : > Thank you for your feedback Alex! Thanks for answering ! > > On 10/02/2018 09:28 AM, Alex Garel wrote: >> >> * chunk processing (kind of handling streaming data) :  when >> dealing with lot of data, the abili

[scikit-learn] [ANN] Scikit-learn 0.20.0

2018-10-02 Thread Alex Garel
Le 26/09/2018 à 21:59, Joel Nothman a écrit : > And for those interested in what's in the pipeline, we are trying to > draft a > roadmap...  > https://github.com/scikit-learn/scikit-learn/wiki/Draft-Roadmap-2018 Hello, First of all thanks for the incredible work on scikit-learn. I found the RoadM

[scikit-learn] Outliers removal

2018-04-04 Thread Alex Garel
Hello, First, thanks for the fantastic scikit-learn library. I have the following use case: For a classification problem, I have a list of sentences and use word2vec and a method (eg. mean, or weigthed mean, or attention and mean) to transform sentences to vectors. Because my dataset is very nois

Re: [scikit-learn] sklearn - knn sklearn.neighbors kneighbors function producing unexpected result for text analysis?

2017-04-20 Thread Alex Garel
I'm not totally sure of what you're trying to do, but here are some remarks that may help you: 1. in modelfit = model.fit(count_vect, enc), the enc parameter is not used, only the count_vect matrix is used 2. when you use kneighbors you get vectors corresponding to wiki['text'] not to wiki['name']