[scikit-learn] purpose of test: check_classifiers_train

Guillaume Lemaître g.lemaitre58 at gmail.com
Wed Oct 11 13:09:21 EDT 2017


Not sure 100% but this is an integration/sanity check since all classifiers
are supposed to predict quite well and data used to train.
This is true that 83% is empirical but it allows to spot any changes done
in the algorithms even if the unit tests are passing for some reason.

On 11 October 2017 at 18:52, Michael Capizzi <mcapizzi at email.arizona.edu>
wrote:

> I’m wondering if anyone can identify the purpose of this test:
> check_classifiers_train(), specifically this line:
> https://github.com/scikit-learn/scikit-learn/blob/ef5cb84a/sklearn/utils/
> estimator_checks.py#L1106
>
> My custom classifier (which I’m hoping to submit to scikit-learn-contrib)
> is failing this test:
>
>   File "/Users/mcapizzi/miniconda3/envs/nb_plus_svm/lib/python3.6/site-packages/sklearn/utils/estimator_checks.py", line 1106, in check_classifiers_train
>     assert_greater(accuracy_score(y, y_pred), 0.83)
> AssertionError: 0.31333333333333335 not greater than 0.83
>
> And while it’s disturbing that my classifier is getting 31% accuracy
> when, clearly, the test writer expects it to be in the upper-80s, I’m not
> sure I understand why that would be a test condition.
>
> Thanks for any insight.
>>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
>


-- 
Guillaume Lemaitre
INRIA Saclay - Parietal team
Center for Data Science Paris-Saclay
https://glemaitre.github.io/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20171011/5bc1edd4/attachment.html>


More information about the scikit-learn mailing list