Ceshine's Data Science Tweet Collection

by math_rachel on 2019-09-04 (UTC).

Essay grading software (used in 21 states) focuses on metrics like sentence length, vocab, spelling, & subject-verb agreement, but ignores hard-to-measure aspects like creativity.

Meaningless gibberish essays created with sophisticated words score well.https://t.co/ESgE8WSRO2 pic.twitter.com/sxC37KiH3Q
— Rachel Thomas (@math_rachel) September 4, 2019

nlp bias

by math_rachel on 2019-09-04 (UTC).

E-rater gave students from China high scores for essay length & sophisticated word choice, and higher overall grades than human graders gave.

E-rater gave African Americans low marks for grammar, style, & organization, and lower overall grades than expert human graders gave them pic.twitter.com/Vw12Rjt4QY
— Rachel Thomas (@math_rachel) September 4, 2019

nlp bias

Tags