Hello, Pandas DataFrame output is now available for all sklearn transformers (in dev version 1.2)! This will make running pipelines on data frames much easier, and provides better ways to track feature names.
There is a 14-minute video with examples, some more information and some FAQs answered at the end [a]. This is one of the biggest improvements in scikit-learn in a long time and we'd love your feedback! Please try out the nightly built and give it a go. We'd love to hear both about whether this helps your use cases and any bugs you find. A special thanks to the maintainers: Thomas J. Fan, Guillaume LeMaitre, Christian Lorentzen ! [a] video https://youtu.be/J4KCu9WWDTo [b] example https://scikit-learn.org/dev/auto_examples/miscellaneous/plot_set_output.html#sphx-glr-auto-examples-miscellaneous-plot-set-output-py [c] LinkedIn post https://www.linkedin.com/feed/update/urn:li:activity:6987027021608460289/?actorCompanyId=79865351 --- Reshama Shaikh she/her
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn