Tweeted By @deliprao
The LM (a sequence model) used in ULMFit was trained on an English corpus with stopwords intact. So by throwing away the stopwords you’re creating (or worsening) a covariate shift.
— Delip Rao (@deliprao) November 30, 2018
The LM (a sequence model) used in ULMFit was trained on an English corpus with stopwords intact. So by throwing away the stopwords you’re creating (or worsening) a covariate shift.
— Delip Rao (@deliprao) November 30, 2018