[scikit-learn] [GSoC 2017] First Draft, request for suggestions - Improve Online Learning of Linear Models.
Andreas Mueller
t3kcit at gmail.com
Wed Mar 15 10:48:23 EDT 2017
On 03/15/2017 04:48 AM, Karan Desai wrote:
> 4. About a tool to anneal learning rate: I suggest a new approach to
> look at this - as a callback. I searched through the documentation and
> I could not find this way of handling tidbits during training of
> models. We should be able to provide a callback to the constructor of
> a linear model which can do any dedicated job after every epoch, be it
> learning rate annealing, saving model checkpoint, getting custom
> verbose output, or as creative as uploading data to server for real
> time plots on any website.
There has been some effort on doing adagrad but it was ultimately
discontinued, I think.
There was quite a bit of complexity to handle.
The problem with callbacks is that for callbacks on each iteration to be
feasible, they need to be cython functions.
Otherwise they will be too slow. You could do python callbacks, but they
could not be called at every iteration, and so
they wouldn't be suitable to implement something like adagrad or adam.
Best,
Andy
More information about the scikit-learn
mailing list