Tweeted By @hllo_wrld
If you squint, transformer is like a densely connected factor graph. Network depth approximates the number of rounds of loopy belief propagation.
— Victor Zhong (@hllo_wrld) August 25, 2018
If you squint, transformer is like a densely connected factor graph. Network depth approximates the number of rounds of loopy belief propagation.
— Victor Zhong (@hllo_wrld) August 25, 2018