[scikit-learn] Missing data and decision trees

Stuart Reynolds stuart at stuartreynolds.net
Thu Oct 13 14:14:17 EDT 2016


I'm looking for a decision tree and RF implementation that supports missing
data (without imputation) -- ideally in Python, Java/Scala or C++.

It seems that scikit's decision tree algorithm doesn't allow this -- which
is disappointing because its one of the few methods that should be able to
sensibly handle problems with high amounts of missingness.

Are there plans to allow missing data in scikit's decision trees?

Also, is there any particular reason why missing values weren't supported
originally (e.g. integrates poorly with other features)

Regards
- Stuart
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20161013/6a03ec05/attachment.html>


More information about the scikit-learn mailing list