2020-08 Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention