Tweeted By @HamelHusain
📢 @GitHub is releasing a large dataset for natural language processing and machine learning. It's a large parallel corpus of code and natural language, with benchmarks on IR tasks. Leaderboard is hosted by @weights_biases: https://t.co/rSKbvkfX4S
— Hamel Husain (@HamelHusain) September 26, 2019