Homepage
Close
Menu

Site Navigation

  • Home
  • Archive(TODO)
    • By Day
    • By Month
  • About(TODO)
  • Stats
Close
by seb_ruder on 2021-02-24 (UTC).

Recent Advances in Language Model Fine-tuning

New blog post that takes a closer look at fine-tuning, the most common way large pre-trained language models are used in practice.https://t.co/A5KYoq5zuw

β€” Sebastian Ruder (@seb_ruder) February 24, 2021
learningsurveynlp
by slashML on 2021-02-23 (UTC).

20 hours of new lectures on Deep Learning and Reinforcement Learning with lots of examples https://t.co/3cXv2E0rcd

β€” /MachineLearning (@slashML) February 23, 2021
rllearningvideo
by moyix on 2021-02-23 (UTC).

I made a silly game: try to guess if a C/C++ code snippet is real or GPT2-generated: https://t.co/Rgfk7Hw0h0

β€” Brendan Dolan-Gavitt (@moyix) February 23, 2021
nlpapplication
by danijarh on 2021-02-23 (UTC).

Excited to present Clockwork VAEs for video prediction!

Clockwork VAEs (CW-VAEs) leverage hierarchies of latent sequences, where higher levels tick slower. They learn long-term deps across 1000 frames, semantically separate content, and outperform strong video models.

πŸ‘‡ Thread pic.twitter.com/Cn11MAKjIB

β€” Danijar Hafner (@danijarh) February 23, 2021
researchcv
by hmason on 2021-02-22 (UTC).

This is the AutoML debate all over again. No, you can't replace your data scientists with AutoML code -- what are you going to do when it doesn't work?

Same for prompt engineering vs ML engineering. If you're building these systems you need to understand them. https://t.co/nj1kQTetiw

β€” Hilary Mason (@hmason) February 22, 2021
thoughtmisc
by huggingface on 2021-02-22 (UTC).

The new SOTA is in Transformers! DeBERTa-v2 beats the human baseline on SuperGLUE and up to a crazy 91.7% dev accuracy on MNLI task.

Beats T5 while 10x smaller!

DeBERTa-v2 contributed by @Pengcheng2020 from @MSFTResearch

Try it directly on the hub: https://t.co/HhlL5WrJxp pic.twitter.com/fcUUCiKE0z

β€” Hugging Face (@huggingface) February 22, 2021
researchtoolw_codenlp
by skyetetra on 2021-02-22 (UTC).

Want to use Hugo for your website but hate that you can’t get the themes to look *exactly* how you want them? Check out my new blog post: Hugo for Fussy People! In it I go through how to turn a blank theme into precisely what you want. Fuss away!https://t.co/hEGZA7rgoH

β€” Dr. Jacqueline Nolis (@skyetetra) February 22, 2021
tutorialtool
by fchollet on 2021-02-22 (UTC).

For best results, fall in the love with the process, not the result

β€” FranΓ§ois Chollet (@fchollet) February 22, 2021
miscthought
by randal_olson on 2021-02-21 (UTC).

Frequency of letters in English words and where they occur in the word. #dataviz

Source: https://t.co/2rPRLDI1Om pic.twitter.com/T1hI4CLqD3

β€” Randy Olson (@randal_olson) February 21, 2021
dataviz
by karpathy on 2021-02-21 (UTC).

Taming Transformers for High-Resolution Image Synthesis https://t.co/6zdyT0HaR0 impressive work/results! (also fun to see a shoutout and my minGPT code used for the transformer :)) pic.twitter.com/cApDT7Yf67

β€” Andrej Karpathy (@karpathy) February 21, 2021
researchcvw_code
by Al_Grigor on 2021-02-21 (UTC).

8 reasons machine learning projects fail - by @elenasamuylova

πŸ”Έ Doing ML for wrong reasons
πŸ”Έ ML not needed
πŸ”Έ Bad data
πŸ”Έ Poor problem framing
πŸ”Έ Model β‰  product
πŸ”Έ Bad infrastructure
πŸ”Έ No trust from stakeholders
πŸ”Έ Production failures

Solution? πŸ‘‰ https://t.co/mvs7sJyxDe pic.twitter.com/poTAzwWT4b

β€” Alexey Grigorev (@Al_Grigor) February 21, 2021
miscthoughttiplearning
by antgoldbloom on 2021-02-21 (UTC).

.@GradioML is a neat library for interacting with a trained model. It's useful for debugging and for giving collaborators the an easy way to interact with the model.

Here's a notebook to try it:https://t.co/cwzzdpCTbZ
(Hit "Copy and Edit" and then run the notebook.) pic.twitter.com/FsVKULbLaV

β€” Anthony Goldbloom (@antgoldbloom) February 21, 2021
learningtooldataviztutorial
  • Prev
  • 107
  • 108
  • 109
  • 110
  • 111
  • …
  • Next

Tags

learning tutorial misc nlp rstats gan ethics research dataviz survey python tool security kaggle video thought bayesian humour tensorflow w_code bias dataset pytorch cv tip application javascript forecast swift golang rl jax julia gnn causal surey diffusion
Β© Copyright Philosophy 2018 Site Template by Colorlib