Attention is all you need is a paper that introduced the transformer architecture, a model that has been used in many NLP tasks.