Medium-Blog
A great intuitive article by co-author of original paper, Reading it before the original paper would be really helpful
Attention is all you need
Research Paper published by Vaswani et al
Here is a great YouTube video explaining
this
paper. You may follow this channel for good paper explanations.
https://d2l.ai/chapter_attention-mechanisms/attention.html
A nice explanation of the attention mechanism along with a simple code implementation in PyTorch that drives home the point.
Hugging Face Article
A great deep dive explainer with great visualizations along with mathematical notations that paint a clear image
Fastai YouTube video on Transformers
Feel free to add any more resources that you find helpful during your foray into Transformers!