Hi Anna,

You can set shuffle=False (it's set to True by default in the 
make_classification function). Then, the resulting features will be sorted as 
follows:  X[:, :n_informative + n_redundant + n_repeated]. I.e., if you set 
“n_features=1000” and “n_informative=20”, the first 20 features will be the 
informative ones.

Best,
Sebastian

> On Aug 12, 2020, at 8:35 AM, Anna Jenul <anna.je...@nmbu.no> wrote:
> 
> Hi!
> I am generating own datasets with sklearn.datasets.make_classification. 
> Unfortunately, I cannot figure out which of the generated features are the 
> informative ones. In my example I generate “n_features=1000” and 
> “n_informative=20”. Is there any possibility to get the informative features 
> after the dataset is generated?
> Thanks,
> Anna
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to