Tweeted By @johnhewtt
Learned a lot about LSTM behavior -- in very different ways -- from two excellent @acl2018 papers: Sharp Nearby, Fuzzy Far Away... by @ukhndlwl, He He, Peng Qi, and @jurafsky, and LSTM as Dynamically Computed... by @omerlevy_ , @kentonctlee, @nfitz, @lukezettlemoyer.
— John Hewitt (@johnhewtt) June 2, 2018