Lectures
Readings
-
Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. “Attention is all you need.” In Advances in Neural Information Processing Systems, pp. 5998-6008. 2017.
-
Mary Phuong and Marcus Hutter. 2022. Formal Algorithms for Transformers. arXiv:2207.09238 [cs].
-
Jay Alammar, The Illustrated Transformer.
-
Lin, Tianyang, Yuxin Wang, Xiangyang Liu, and Xipeng Qiu. “A survey of transformers.” AI Open (2022).