Compressive Transformer
Compressive Transformers for Long-Range Sequence Modelling, 2019-11
https://deepmind.com/blog/article/A_new_model_and_dataset_for_long-range_memory
memory model
,
transformer
,
dnc
,
lstm
,
language model
,
nlp
,
Timothy P. Lillicrap
,
DeepMind
,
2019