This paper looks at how to measure the condition number of the training problem for deep networks, how this is affected by batch size and learning rate, and how this characterises performance in terms of convergence and generalisation.
[PAMI 2018 paper]
[PAMI 2018 paper]
No comments:
Post a Comment
Note: only a member of this blog may post a comment.