date:20170416

[scikit-learn] sklearn - knn sklearn.neighbors kneighbors function producing unexpected result for text analysis?

2017-04-16 Thread Evaristo Caraballo via scikit-learn

I have been asked to implement a simple knn for text similarity analysis. I tried by using sklearn.neighbors module.The file to be analysed consisted on 2 relevant columns: "text" and "name".The knn model should be fitted with bag-of-words of a corpus of around 60,000 pre-treated text fragments

[scikit-learn] Ordinary Least Square Regression Under-determined system.

2017-04-16 Thread Joshua Mannheimer

Hi all, So I am trying to write a Principle Components Regression implementation in Python to match the PLS package in R. I am getting better results in R so I am trying to figure out where the discrepancy was. The data I am using is way undetermined where n_features ~ 50,000 and n_samples ~ 500 t