Bayesian model averaging mitigates double descent! We have just posted this new result in section 7 of our paper on Bayesian deep learning with @Pavel_Izmailov: https://t.co/midasGNPYn. The result highlights the importance of *multi-modal* marginalization with Multi-SWAG. 1/3 pic.twitter.com/ZbhxGdjW5I
— Andrew Gordon Wilson (@andrewgwils) April 28, 2020