Tweeted By @karpathy
Amusing! Object detection cast naively into language modeling framework + borrowing many of the tips&tricks.
— Andrej Karpathy (@karpathy) September 24, 2021
- random object ordering seems fine ✅
- coords, class labels flattened into a single softmax 😂
- sequence augmentation is the most gnarly part, almost as yucky as nms 😬 https://t.co/FxSz5UbpxY