<html>

<head>

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

<style type="text/css" style="display:none"><!--P{margin-top:0;margin-bottom:0;} P{margin-top:0;margin-bottom:0;}--></style>

</head>

<body dir="ltr" style="font-size:12pt;color:#000000;background-color:#FFFFFF;font-family:Calibri,Arial,Helvetica,sans-serif;">

<p><br>

</p>

<div dir="ltr" style="font-size:12pt; color:#000000; background-color:#FFFFFF; font-family:Calibri,Arial,Helvetica,sans-serif">

<div id="divRplyFwdMsg" dir="ltr"></div>

Dear all,

<div>

<p><br>

</p>

<p>For my research, I'm working with multi-output decision trees. In the current sklearn implementation, a tree can predict either several numerical or several categorical targets simultaneously, but not a mixture of those. However, predicting various targets

 jointly is often beneficial both in terms of speed and accuracy. Because of that, I'm willing to add this functionality.<br>

</p>

<p><br>

</p>

<p>It seems that the only thing to be done is to implement a new node splitting criteria that handles a mixture of nominal and numerical attributes, and then define a new class of models (such as DecisionTreeRegressor or

</p>

<p>DecisionTreeClassifier, but for mixed output). However, since I'm not an experienced sklearn contributor, I am looking for any hints on how to implement this in effective way, re-using as much functionality already available as possible.

</p>

<p><br>

</p>

<p>Your advice is very welcome.<br>

</p>

<p><br>

</p>

<p>Best,<br>

Evgeniya<br>

</p>

</div>

</div>

</body>

</html>