Hi
As was recently mentioned in PR #18594, the problem with the boston
housing dataset does not go away, just because we remove it from
scikit-learn. On the contrary, it is a valuable dataset to show and
teach bias and discrimination - issue #16715 is still waiting for
someone to write an exa
Most of those are not talking about the ethical issues of the dataset.
Let's talk about the alternatives we have:
Keep the loader, but raise a warning:
- this will result in most people not changing their code/material, and IMO
mostly ignore the warning. Some
people may see the warning and care ab
Le mar. 13 oct. 2020 à 16:19, Adrin a écrit :
>
> Isn't the Boston dataset available through openml? Maybe here:
> https://www.openml.org/d/531
>
> I'm happy to have the dataset out there on opemml, and for any material that
> addresses some of the issues with it.
> But for educational purposes,
Isn't the Boston dataset available through openml? Maybe here:
https://www.openml.org/d/531
I'm happy to have the dataset out there on opemml, and for any material
that addresses some of the issues with it.
But for educational purposes, we don't need to have the dataset in the
package as long as u
Thanks for your input, this is also an extension I was thinking of.
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn
I very much like your paragraph, Olivier. I might recommend additionally
raising it as a warning when calling the data creation function.
For reference, in scikit-image when we removed Lena we raised a warning and
returned an alternative (the now-famous `data.astronaut()`) for two versions,
bef