Ceshine's Data Science Tweet Collection

by yvespeirsman on 2019-08-27 (UTC).

Transfer Learning works great for Natural Language Processing, but sometimes its large models can be hard to handle. At NLP Town we used model distillation to train @spacy_io text classifiers that rival BERT for sentiment analysishttps://t.co/mLvJnYou0R #deeplearning #NLProc pic.twitter.com/lcl3bQ9S8F
— Yves Peirsman (@yvespeirsman) August 27, 2019

nlp research

by chipro on 2019-08-28 (UTC).

Now that we know it's possible to achieve comparable results to BERT using only 66M parameters, can someone find a way to train a 66M param model from scratch instead of distilling? https://t.co/ycJjMwSwsr
— Chip Huyen (@chipro) August 28, 2019

nlp research w_code

Tags