I wanted to highlight an intriguing paper I presented at a journal club
recently:
* Samuel L Smith, Benoit Dherin, David Barrett, Soham De (2021) On the Origin
of Implicit Regularization in Stochastic Gradient Descent
[https://openreview.net/forum?id=rq_Qr0c1Hyo]
There's actually a related paper that came