Tweeted By @huggingface
Who needs floats? I-BERT doesn't!
— Hugging Face (@huggingface) March 22, 2021
I-BERT: A quantized Transformer with int-8 *only*
Get the best parameters with Transformers and use in TensorRT for a 4x (!!) speedup!
Contributed by @sehoonkim418, @amir__gholami @ZheweiYao
Try it on the hub: https://t.co/00w4evcRUe pic.twitter.com/eM1LJKgGAX