Re: [Scikit-learn-general] class label hashing

2015-05-01 Thread Andreas Mueller
nberg [michael.eickenb...@gmail.com] *Sent:* Friday, May 01, 2015 11:13 AM *To:* scikit-learn-general@lists.sourceforge.net *Subject:* Re: [Scikit-learn-general] class label hashing What do expect a classifier to predict on a label that it has never seen during training? If there were structure in the t

Re: [Scikit-learn-general] class label hashing

2015-05-01 Thread Pagliari, Roberto
et<mailto:scikit-learn-general@lists.sourceforge.net> Subject: Re: [Scikit-learn-general] class label hashing Roberto, I am not sure if this causes problems regarding the implementation, but in any case, I'd recommend you to use the LabelEncoder to have your classes mapped to a fixed r

Re: [Scikit-learn-general] class label hashing

2015-05-01 Thread Michael Eickenberg
fferent label numbers are really different > classes ] > > > > From: Sebastian Raschka [se.rasc...@gmail.com] > Sent: Thursday, April 30, 2015 11:08 PM > To: scikit-learn-general@lists.sourceforge.net > Subject: Re: [Scikit-learn-ge

Re: [Scikit-learn-general] class label hashing

2015-05-01 Thread Pagliari, Roberto
From: Sebastian Raschka [se.rasc...@gmail.com] Sent: Thursday, April 30, 2015 11:08 PM To: scikit-learn-general@lists.sourceforge.net Subject: Re: [Scikit-learn-general] class label hashing Roberto, I am not sure if this causes problems regarding the implementation, but in any case, I'd recommend

Re: [Scikit-learn-general] class label hashing

2015-04-30 Thread Sebastian Raschka
Roberto, I am not sure if this causes problems regarding the implementation, but in any case, I'd recommend you to use the LabelEncoder to have your classes mapped to a fixed range, e.g., 0, 1, 2, 3, 4, 5. And having different class labels in training and test set that reference to the same clas

[Scikit-learn-general] class label hashing

2015-04-30 Thread Pagliari, Roberto
Suppose I train a classifier with dataset1, which contains labels 0 3 4 6 7 and then predict over dataset2 with labels 0 3 4 8 10 will the hashing be the same for labels 0, 3 and 4? and will scikit learn get confused by seeing new labels such as 8 and 10? Thank you, -