Tag - nlp

by slashML on 2020-02-14 (UTC).

nVidia sets World Record BERT Training Time - 47mins https://t.co/VaEdfNyN14
— /MachineLearning (@slashML) February 14, 2020

nlp misc

by tunguz on 2020-02-11 (UTC).

Here is a @Kaggle dataset of 28,752 Project Gutenberg books that has been used for the recent @DeepMind long-range memory research.https://t.co/e0OzBD1wlf #AI #ML #deeplearning #NLP @KaggleDatasets pic.twitter.com/hsTQciAY7C
— Bojan Tunguz (@tunguz) February 11, 2020

dataset nlp

by seb_ruder on 2020-02-11 (UTC).

This is great work that collects corpora and evaluates models for two extremely low-resource languages spoken in Africa 🌍, Twi and Yoruba.
Link to the paper: https://t.co/Cm5YhzTbBL https://t.co/mjAvuVyPjE
— Sebastian Ruder (@seb_ruder) February 11, 2020

research nlp

by ericjang11 on 2020-02-11 (UTC).

The search engine of the future will know "who Jason Mraz is engaged to" not by querying some semi-manually curated semantic triplet graph, but simply running a "fact answering neural network" on the raw question. https://t.co/3MfRxQVEfN
— Eric Jang 🇺🇸🇹🇼 (@ericjang11) February 11, 2020

research nlp

In a group with 1 other tweets.

by ada_rob on 2020-02-11 (UTC).

We evaluated on Natural Questions, WebQuestions, and TriviaQA, outperforming all previous open-domain systems on NQ and WQ.

(3/5) pic.twitter.com/Xv23zOBDm9
— Adam Roberts (@ada_rob) February 11, 2020

research nlp

In a group with 1 other tweets.

by ada_rob on 2020-02-11 (UTC).

New preprint: How Much Knowledge Can You Pack into the Parameters of a Language Model?

We show that T5 outperforms all previous open-domain QA systems *without using any external knowledge or context*.

Joint work w/ @colinraffel & Noam Shazeer.https://t.co/Ojg3wSUDQq
(1/5) pic.twitter.com/3adQ59LFYr
— Adam Roberts (@ada_rob) February 11, 2020

research nlp

In a group with 1 other tweets.

by hardmaru on 2020-02-11 (UTC).

Turing-NLG: A 17-billion-parameter language model

“Any model with more than 1.3B parameters cannot fit into a single GPU (even one with 32GB memory)… The resulting T-NLG model has 78 Transformer layers with a hidden size of 4256 and 28 attention heads.”https://t.co/bRjEacrZma
— hardmaru (@hardmaru) February 11, 2020

nlp research

In a group with 1 other tweets.

by gneubig on 2020-02-07 (UTC).

Two major methods for learning multilingual embeddings are 1. monolingual training then alignment and 2. joint training. Our #ICLR2020 paper asks "why not do both?" Result: even jointly-trained embeddings still benefit significantly from alignment: https://t.co/HanXbBhDko pic.twitter.com/7ozKHjSmAz
— Graham Neubig (@gneubig) February 7, 2020

research nlp

by ylecun on 2020-02-07 (UTC).

New dataset: 4.5B parallel sentences in 576 langage pairs. https://t.co/nWY5egojho
— Yann LeCun (@ylecun) February 7, 2020

dataset nlp

by GoogleAI on 2020-02-06 (UTC).

TyDi QA is a new multilingual dataset for information-seeking question answering featuring 11 Typologically Diverse languages and over 200k QA pairs. Learn more and start experimenting with the data and code ↓ https://t.co/azeYPUvqXZ
— Google AI (@GoogleAI) February 6, 2020

dataset nlp

by julien_c on 2020-02-03 (UTC).

New models from:
- @Wietsedv (Dutch BERT),
- @douwekiela at Facebook AI (MMBT, multi-modal model)
- @formiel, @laurent_besacie et al. (FlauBERT, French-trained XLM-like)
- @loretoparisi, @simofrancia et al. at @musixmatch (UmBERTo, Italian CamemBERT-like) pic.twitter.com/qgJ0fwqiwC
— Julien Chaumond (@julien_c) February 3, 2020

tool nlp cv pytorch tensorflow

by peteskomoroch on 2020-01-30 (UTC).

There is a little known project call News Crawl from the folks at @CommonCrawl that has a giant real time S3 archive in WARC format of articles from over 50K news publication feeds: https://t.co/RZUIx7D3Zo github code is also here: https://t.co/DBSiIdR0BE
— Peter Skomoroch (@peteskomoroch) January 30, 2020

dataset nlp

Tag: nlp

Tags