Ceshine's Data Science Tweet Collection

by rasbt on 2020-02-11 (UTC).

ZeRO & DeepSpeed: New system optimizations enable training models with over 100 billion parametershttps://t.co/PbXx7zrtZm pic.twitter.com/W5xMFsfH0U
— Sebastian Raschka (@rasbt) February 11, 2020

tool

by Thom_Wolf on 2020-02-11 (UTC).

Nice work and the accompanying library/codebase for model-parallelism in PyTorch looks really sweet!
👉 https://t.co/yqzaf5glBa https://t.co/sg4igQV6xI
— Thomas Wolf (@Thom_Wolf) February 11, 2020

tool pytorch

Tags