Articles to Read:
- Illustrated Transformer
@jay_alammar
: https://jalammar.github.io/illustrated-transformer/ - Illustrated Word2Vec
@jay_alammar
: https://jalammar.github.io/illustrated-word2vec/ - KV-Cache in Transformers: https://kipp.ly/transformer-inference-arithmetic/