[scikit-learn] Multi-Output Decision Trees for mixed classification-regerssion problems

Evgeniya Korneva evgeniya.korneva at kuleuven.be
Mon Feb 12 06:40:03 EST 2018

Dear all,

For my research, I'm working with multi-output decision trees. In the current sklearn implementation, a tree can predict either several numerical or several categorical targets simultaneously, but not a mixture of those. However, predicting various targets jointly is often beneficial both in terms of speed and accuracy. Because of that, I'm willing to add this functionality.

It seems that the only thing to be done is to implement a new node splitting criteria that handles a mixture of nominal and numerical attributes, and then define a new class of models (such as DecisionTreeRegressor or

DecisionTreeClassifier, but for mixed output). However, since I'm not an experienced sklearn contributor, I am looking for any hints on how to implement this in effective way, re-using as much functionality already available as possible.

Your advice is very welcome.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20180212/98e7eb01/attachment.html>

More information about the scikit-learn mailing list