Tweeted By @GaelVaroquaux
New release of dirty-cat, v0.1 ✨: facilitating machine learning on non-curated data.
— Gael Varoquaux (@GaelVaroquaux) February 17, 2021
Big new feature: the GapEncoder, which encodes on interpretable latent categories inferred from recurrent substrings, and robust to typos and other variationshttps://t.co/Y5Azf61ERE pic.twitter.com/yy6JIVUsZo