Tweeted By @dadabots
Just finished assembling #DadaGP v1.0 --- a tokenized symbolic music dataset of 26181 GuitarPro songs. Totaling 115M tokens, about as big as WikiText-103. Includes GuitarPro5 encoder/decoder. Who wants to train a generator? #nlp #mir #languagemodel #transformer @huggingface pic.twitter.com/ocyrZwYHOg
— dadabots (@dadabots) August 29, 2020