Tweeted By @_akhaliq
Peripheral Vision Transformer
— AK (@_akhaliq) June 15, 2022
abs: https://t.co/c6R8BfNDPS
propose to incorporate peripheral position encoding to the multi-head self-attention layers to let the network learn to partition the visual field into diverse peripheral regions given training data pic.twitter.com/S78e7WXDKh