Tweeted By @PyTorch
Stochastic Weight Averaging: a simple procedure that improves generalization over SGD at no additional cost.
— PyTorch (@PyTorch) April 29, 2019
Can be used as a drop-in replacement for any other optimizer in PyTorch.
Read more: https://t.co/IRhz40AZKU
guest blogpost by @Pavel_Izmailov and @andrewgwils pic.twitter.com/yU0HKDYr7v