Ceshine's Data Science Tweet Collection

by karpathy on 2021-08-27 (UTC).

Badly tuned LR decay schedules are an excellent way to silently shoot yourself in the foot. Models can often look like they are converging but it's just LR getting too low too fast. FixedLR (+optional warmup) with 1 manual decay of 10X on plateau is a safe strong baseline.
— Andrej Karpathy (@karpathy) August 27, 2021

tip

In a group with 1 other tweets.

by lpachter on 2021-08-27 (UTC).

It's time to stop making t-SNE & UMAP plots. In a new preprint w/ Tara Chari we show that while they display some correlation with the underlying high-dimension data, they don't preserve local or global structure & are misleading. They're also arbitrary.🧵https://t.co/XkAOTKlOcs pic.twitter.com/dmFzD5RR6R
— Lior Pachter (@lpachter) August 27, 2021

research dataviz

by random_walker on 2021-08-26 (UTC).

When Netflix starting using machine learning to personalize thumbnails a few years ago, the algorithm learned to target by race. Now it's learned that trashy clickbait works. The only surprise is that it's taken so long. https://t.co/fhv8BRTsRm https://t.co/iU0tdx0y6z
— Arvind Narayanan (@random_walker) August 26, 2021

ethics bias misc

by yoavgo on 2021-08-26 (UTC).

This.
Not all uses of "NLP" in a socially sensitive area are automatically bad.
It really depends on the method you are using, how you are using it, and what you will do with the results. NLP is a *measurement tool* over text. What we measure, how we interpret: that's up to us. https://t.co/byiG2kb2YK
— (((ل()(ل() 'yoav))))👾 (@yoavgo) August 26, 2021

nlp thought

by rctatman on 2021-08-25 (UTC).

debiasing & similar approaches are about *mitigating* the impact of specific latent variables that are relevant to the current application and that's genuinely the best you can hope for
— Rachael Tatman (@rctatman) August 25, 2021

nlp thought

by JuliaAngwin on 2021-08-25 (UTC).

Back when we had an office @themarkup, our wifi password was “math+tears”

Today’s investigation explains why. @eh_mah_nwel and @lkirchner use math and human stories to reveal entrenched racism in mortgage lending. https://t.co/ApkvJ61ve6
/1
— Julia Angwin (@JuliaAngwin) August 25, 2021

misc bias

by ak92501 on 2021-08-25 (UTC).

One TTS Alignment To Rule Them All
pdf: https://t.co/yQu2GWB6uw
abs: https://t.co/cAAxzOuGGg

present an alignment framework that is broadly applicable to various TTS architectures, both autoregressive and parallel pic.twitter.com/e4tpCSNwWC
— AK (@ak92501) August 25, 2021

research

by karpathy on 2021-08-24 (UTC).

TIL 😳😵‍💫😱. This single line change sped up our data loader 10% pic.twitter.com/9gIzscrgTQ
— Andrej Karpathy (@karpathy) August 24, 2021

tip python

by stanfordnlp on 2021-08-24 (UTC).

🆕 Mistral: A framework for easy large-scale language model training, built with @huggingface 🤗 Transformers. Props to @laurel_orr1, @siddkaramcheti, and team mates. The initial technical work from the Stanford Center for Research on #FoundationModels https://t.co/RlmdMtij1D
— Stanford NLP Group (@stanfordnlp) August 24, 2021

nlp tool

by ak92501 on 2021-08-24 (UTC).

How Can Increased Randomness in Stochastic Gradient Descent Improve Generalization?
pdf: https://t.co/Jsj1hpi3vB
abs: https://t.co/nEGOGZ2Z8v pic.twitter.com/weTbqMbQbF
— AK (@ak92501) August 24, 2021

research cv

by seb_ruder on 2021-08-23 (UTC).

Challenges and Opportunities in NLP Benchmarking

Recent NLP models have outpaced the benchmarks to test for them. I provide an overview of challenges and opportunities in this blog post.https://t.co/NbVfcwGX8z
— Sebastian Ruder (@seb_ruder) August 23, 2021

nlp dataset misc

by seb_ruder on 2021-08-23 (UTC).

Our RemBERT model (ICLR 2021) is finally open-source and available in 🤗 Transformers.

RemBERT is a large multilingual Transformer that outperforms XLM-R (and mT5 with similar # of params) in zero-shot transfer.

Docs: https://t.co/AKwV0UF6cT
Paper: https://t.co/TXF7qlJtUY pic.twitter.com/ytIiMOqVks
— Sebastian Ruder (@seb_ruder) August 23, 2021

research nlp w_code

Tags