[scikit-learn] Categorical handling
georg.kf.heiler at gmail.com
Thu Aug 17 07:50:33 EDT 2017
how can I properly handle categorical values in scikit-learn?
- scikit-learn syle fit/transform methods to encode labels of
categorical features of X
- should handle unseen labels
- should be faster than running a label encoder manually for each fold
and manually checking if the label already was seen in the training data
i.e. what I currently do (
links to https://gist.github.com/geoHeil/5caff5236b4850d673b2c9b0799dc2ce
- only some columns are categorical, and only these should be converted
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the scikit-learn