[scikit-learn] How to tune parameters when using partial_fit

Brown J.B. jbbrown at kuhp.kyoto-u.ac.jp
Tue Jun 11 05:34:07 EDT 2019


>
> I'm curious about is there any suitable/general way to tune parameters
> batch by batch?
> Because the distribution is not easy to know when the dataset is too large
> to load into memory.
>

Repeated subsampling to estimate a distribution is one alternative.
Not guaranteed to match the global distribution, but you should get a
reasonable estimate with enough repetitions.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20190611/2803050b/attachment-0001.html>


More information about the scikit-learn mailing list