Tweeted By @huggingface
Want speedy transformers models w/o a GPU?! 🧐
— Hugging Face (@huggingface) September 1, 2020
Starting with transformers v3.1.0 your models can now run at the speed of light on commodity CPUs thanks to ONNX Runtime quantization!🚀. Check out our 2nd blog post with ONNX Runtime on the subject! 🔥https://t.co/OzPZ8y6XuW