[scikit-learn] Pandas DataFrame output is now available for all sklearn transformers

Reshama Shaikh reshama.stat at gmail.com
Thu Oct 27 15:39:58 EDT 2022


Hello,

Pandas DataFrame output is now available for all sklearn transformers (in
dev version 1.2)! This will make running pipelines on data frames much
easier, and provides better ways to track feature names.

There is a 14-minute video with examples, some more information and some
FAQs answered at the end [a].

This is one of the biggest improvements in scikit-learn in a long time and
we'd love your feedback! Please try out the nightly built and give it a go.
We'd love to hear both about whether this helps your use cases and any bugs
you find.

A special thanks to the maintainers: Thomas J. Fan, Guillaume LeMaitre,
Christian Lorentzen !

[a] video
https://youtu.be/J4KCu9WWDTo

[b] example
https://scikit-learn.org/dev/auto_examples/miscellaneous/plot_set_output.html#sphx-glr-auto-examples-miscellaneous-plot-set-output-py

[c] LinkedIn post
https://www.linkedin.com/feed/update/urn:li:activity:6987027021608460289/?actorCompanyId=79865351

---
Reshama Shaikh
she/her
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.python.org/pipermail/scikit-learn/attachments/20221027/1ca0df98/attachment.html>


More information about the scikit-learn mailing list