Tweeted By @rasbt
ZeRO & DeepSpeed: New system optimizations enable training models with over 100 billion parametershttps://t.co/PbXx7zrtZm pic.twitter.com/W5xMFsfH0U
— Sebastian Raschka (@rasbt) February 11, 2020
ZeRO & DeepSpeed: New system optimizations enable training models with over 100 billion parametershttps://t.co/PbXx7zrtZm pic.twitter.com/W5xMFsfH0U
— Sebastian Raschka (@rasbt) February 11, 2020