Tweeted By @diegovogeid
Did you see this paper (by @perez) where they proved something similar theoretically? (Transformer with a single attention head are Turing complete)
— Diego Francisco Valenzuela Iturra (@diegovogeid) May 28, 2019
- On the Turing Completeness of Modern Neural Network Architectureshttps://t.co/DaHP1biIa8