Tag - dataset

by ak92501 on 2022-01-13 (UTC).

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
abs: https://t.co/ehdybJKTr5
project page: https://t.co/HnfvJnYcSM pic.twitter.com/6Q2vHRfCZm
— AK (@ak92501) January 13, 2022

dataset cv

by Google on 2021-09-30 (UTC).

We’re using KaoKore, a machine learning-friendly dataset, to decipher cursive and illustrations in historical Japanese art. Learn how machine learning can be used for humanities research and contribute to cultural preservation. https://t.co/nYSvuehWoR
— Boo-gle 👻 (@Google) September 30, 2021

dataset cv

by OurWorldInData on 2021-09-30 (UTC).

We make all our work publicly available so that researchers and journalists can do their best work every day.

→ https://t.co/qdK1aoGr9U

We bring together the COVID data from around the world every day. Thousands of articles get written based on our team's daily work. pic.twitter.com/OSxY8mzb3Y
— Our World in Data (@OurWorldInData) September 30, 2021

dataset

by ryanjgallag on 2021-09-27 (UTC).

Today I’m open sourcing my code for working with Twitter data

It's designed to make advanced studies of social media easier by coordinating multiple API queries (stream, search, convos, quotes, user timelines) and organizing them using PostgreSQLhttps://t.co/2yYhfr612v

1/
— Ryan J. Gallagher (on the job market!) (@ryanjgallag) September 27, 2021

tool dataset

by rasbt on 2021-09-16 (UTC).

What is your favorite tool for labeling data? Labelme (for image data) came to mind, but then going down the rabbit hole of this question, I learned that there is an entire "awesome-" GitHub repo of data labeling tools: https://t.co/w7ZApH9hT1
— Sebastian Raschka (@rasbt) September 16, 2021

dataset tool

by borisdayma on 2021-09-15 (UTC).

For downloading large image datasets (1M+), I highly recommend https://t.co/U27VlPBfUK from @rom1504

You can even monitor performance and download errors with @weights_biases pic.twitter.com/ZvrHh6B8O0
— Boris Dayma 🥑 (@borisdayma) September 15, 2021

dataset tool cv

by ak92501 on 2021-09-12 (UTC).

LAION-400M: open-source dataset of 400 million image-text pairs
project page: https://t.co/IA8aNpXZ6a pic.twitter.com/f5IoLESnRx
— AK (@ak92501) September 12, 2021

dataset cv

by ak92501 on 2021-09-08 (UTC).

Datasets: A Community Library for Natural Language Processing
abs: https://t.co/xEpY9oQ2a5
github: https://t.co/HvY6Nlf41c

650+ unique datasets, 250+ contributors, and has helped support a variety of novel crossdataset research projects and shared tasks pic.twitter.com/AdlB21Hu2c
— AK (@ak92501) September 8, 2021

dataset nlp w_code

by seb_ruder on 2021-08-23 (UTC).

Challenges and Opportunities in NLP Benchmarking

Recent NLP models have outpaced the benchmarks to test for them. I provide an overview of challenges and opportunities in this blog post.https://t.co/NbVfcwGX8z
— Sebastian Ruder (@seb_ruder) August 23, 2021

nlp dataset misc

by seb_ruder on 2021-08-06 (UTC).

QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension

This is an excellent overview of the QA landscape in NLP with numerous insightful observations by @annargrs @nlpmattg @IAugenstein.https://t.co/xD4Dmm886o pic.twitter.com/4oHABAPe40
— Sebastian Ruder (@seb_ruder) August 6, 2021

nlp dataset

by GoogleAI on 2021-07-28 (UTC).

Today we are releasing the Open Buildings Dataset, a new open-source dataset containing the locations and footprints of >500M buildings with coverage across Africa, which can support numerous scientific and humanitarian applications. Read more at https://t.co/ZAFeD3mWQt pic.twitter.com/hy9PVKx0Hy
— Google AI (@GoogleAI) July 28, 2021

dataset

by topepos on 2021-07-15 (UTC).

There's a new version of the {{modeldata}} package on CRAN. https://t.co/p3DZGk3Xjn

There is a great data set (tate_text).

We're also going to remove the two OkCupid data sets in the next version. #rstats pic.twitter.com/sEU9M3srUU
— Max Kuhn (@topepos) July 15, 2021

dataset tool

Tag: dataset

Tags