Homepage
Close
Menu

Site Navigation

  • Home
  • Archive(TODO)
    • By Day
    • By Month
  • About(TODO)
  • Stats
Close
by math_rachel on 2019-09-04 (UTC).

Essay grading software (used in 21 states) focuses on metrics like sentence length, vocab, spelling, & subject-verb agreement, but ignores hard-to-measure aspects like creativity.

Meaningless gibberish essays created with sophisticated words score well.https://t.co/ESgE8WSRO2 pic.twitter.com/sxC37KiH3Q

— Rachel Thomas (@math_rachel) September 4, 2019
nlpbias
by math_rachel on 2019-09-04 (UTC).

E-rater gave students from China high scores for essay length & sophisticated word choice, and higher overall grades than human graders gave.

E-rater gave African Americans low marks for grammar, style, & organization, and lower overall grades than expert human graders gave them pic.twitter.com/Vw12Rjt4QY

— Rachel Thomas (@math_rachel) September 4, 2019
nlpbias

Tags

learning tutorial misc nlp rstats gan ethics research dataviz survey python tool security kaggle video thought bayesian humour tensorflow w_code bias dataset pytorch cv tip application javascript forecast swift golang rl jax julia gnn causal surey diffusion
© Copyright Philosophy 2018 Site Template by Colorlib