[scikit-learn] Issue with DecisionTreeClassifier
Nelson Liu
nfliu at uw.edu
Mon Aug 29 23:22:41 EDT 2016
estimator.tree_.value gives the constant prediction of the tree at each
node. Think of it as what the tree would output if that node was a leaf.
I don't think we have a readily available way of checking the number of
training samples of each class in a given tree node. The closest thing
easily accessible is estimator.tree_.n_node_samples. Getting finer-grained
counts of the number of samples in each class would require modifying the
source code, I think.
On Mon, Aug 29, 2016 at 8:06 PM Ibrahim Dalal via scikit-learn <
scikit-learn at python.org> wrote:
> Hi,
>
> What does the estimator.tree_.value array represent? I looked up the
> source code but not able to get what it is. I am interested in the number
> of training samples of each class in a given tree node.
>
> Thanks
>
> On Mon, Aug 29, 2016 at 9:22 PM, Andreas Mueller <t3kcit at gmail.com> wrote:
>
>>
>>
>> On 08/28/2016 03:23 PM, Nelson Liu wrote:
>>
>> That should be:
>> node indicator = estimator.tree_.decision_path(X_test)
>>
>> PR welcome :)
>>
>> Was there a reason not to make this a "plot" example?
>> Would it take too long? Not having run examples by CI is a pretty big
>> maintenance burden.
>>
>> _______________________________________________
>> scikit-learn mailing list
>> scikit-learn at python.org
>> https://mail.python.org/mailman/listinfo/scikit-learn
>>
>>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20160830/a6701107/attachment.html>
More information about the scikit-learn
mailing list