Homepage
Close
Menu

Site Navigation

  • Home
  • Archive(TODO)
    • By Day
    • By Month
  • About(TODO)
  • Stats
Close
by DeepMind on 2021-02-03 (UTC).

Why does Stochastic Gradient Descent generalise well in deep networks?

Our team shows that if the learning rate is small but finite, the mean iterate of random shuffling SGD stays close to the path of gradient flow, but on a modified loss landscape https://t.co/JUAzPujWfP pic.twitter.com/NtlbVaMfLC

— DeepMind (@DeepMind) February 3, 2021
research
by hardmaru on 2021-02-03 (UTC).

Scaling Laws for Transferhttps://t.co/m7Vh5aOTjx pic.twitter.com/5lw9GyOAp5

— hardmaru (@hardmaru) February 3, 2021
research
by ch402 on 2021-02-02 (UTC).

Turns out we can reverse engineer chunks of neural network and then write out weights by hand that reimplement it. Not sure what higher standard there is for showing we understand something. https://t.co/JKcpEolUJu

— Chris Olah (@ch402) February 2, 2021
researchdataviz
by nelsonfliu on 2021-02-02 (UTC).

Large, natural datasets are invaluable for training accurate, deployable systems, but are they required for driving modeling innovation? Can we use small, synthetic benchmarks instead? Our new paper asks this: https://t.co/b9WYYonQxi

w/ Tony Lee, @robinomial, @percyliang

(1/8) pic.twitter.com/sdhRk5UT5L

— Nelson Liu (@nelsonfliu) February 2, 2021
research
by ak92501 on 2021-02-02 (UTC).

Speech Recognition by Simply Fine-tuning BERT
pdf: https://t.co/2kit83mnj9
abs: https://t.co/qyOosTp8Ey pic.twitter.com/brIQfVxWim

— AK (@ak92501) February 2, 2021
research
by distillpub on 2021-02-02 (UTC).

Curve Circuits — A new Distill article by @nickcammarata, @gabeeegoooh, @shancarter, @csvoss, @ludwigschubert, and @ch402. This is the sixth article in the circuits thread.https://t.co/MhDgvNVRW5 pic.twitter.com/e1QIa8qL7u

— Distill (@distillpub) February 2, 2021
learningdataviz
by PyTorch on 2021-02-01 (UTC).

Learn how to use the Determined AI platform to offload common infrastructure problems such as scaling training and hyperparameter tuning when developing #PyTorch models. https://t.co/tNYbSMZ3pz

— PyTorch (@PyTorch) February 1, 2021
pytorchlearningtooltutorial
by rcalo on 2021-02-01 (UTC).

I'm beyond excited to share a new book from the @TechPolicyLab. Telling Stories: On Culturally Responsive Artificial Intelligence is a collection of short stories from around the world on the social and cultural impacts of AI. https://t.co/lJHnF7XAfL pic.twitter.com/7QglaYDnWt

— Ryan Calo (@rcalo) February 1, 2021
learningethics
by _inesmontani on 2021-02-01 (UTC).

If you want to find out what the new @spacy_io v3 is all about, check out this video I recorded with @honnibal 😇 We're walking you through some of the most exciting new features!

📺 Watch it here: https://t.co/y0UYh7py3d pic.twitter.com/6HpnvmrAco

— Ines Montani 〰️ (@_inesmontani) February 1, 2021
nlpvideolearningtutorial
by liu_mingyu on 2021-02-01 (UTC).

My awesome colleagues have now released #PyTorch version of StyleGAN2-ADA. (The initial release was in #TensorFlow )

ADA uses a clever data augmentation to help address limit sample problems in #GAN training.https://t.co/75Yttri2KS

— Ming-Yu Liu (@liu_mingyu) February 1, 2021
researchw_codecv
by TimHarford on 2021-02-01 (UTC).

A useful distinction:

Misinformation: incorrect or misleading information.

Disinformation: false information deliberately and often covertly spread in order to influence public opinion or obscure the truth.

Source: Merriam-Webster dictionary.

— Tim Harford (@TimHarford) February 1, 2021
misc
by ChristophMolnar on 2021-02-01 (UTC).

What do you call a machine learning model that perfectly predicts the training data, but does not work for unseen data?

A database

— Christoph Molnar (@ChristophMolnar) February 1, 2021
humourmisc
  • Prev
  • 115
  • 116
  • 117
  • 118
  • 119
  • …
  • Next

Tags

learning tutorial misc nlp rstats gan ethics research dataviz survey python tool security kaggle video thought bayesian humour tensorflow w_code bias dataset pytorch cv tip application javascript forecast swift golang rl jax julia gnn causal surey diffusion
© Copyright Philosophy 2018 Site Template by Colorlib