Tweeted By @jaseweston
There's always something cringe on Twitter, here's a useful one!
— Jason Weston (@jaseweston) November 14, 2022
🚨 new paper 🚨
The CRINGE Loss: Learning what language not to model
Train your LM to not generate bad sequences.
Shows improvements on three tasks (safety, contradictions, open dialogue).https://t.co/yiAzzYQcbV pic.twitter.com/ZMmm31Xdil