Tweeted By @_inesmontani
🐍 TIL: Creating pretty visual diffs in Python is much easier than I thought!
— Ines Montani 〰️ (@_inesmontani) August 1, 2019
I needed a script to refine tokenization alignment between word pieces (transformer models) & linguistically-motivated tokens (e.g. spaCy) & wrote a simple diff printer.
Gist: https://t.co/ZpiLADpqsX pic.twitter.com/zwxkVIvdyE