Chapter 3: Coding Attention Mechanisms 01_main-chapter-code contains the main chapter code. 02_bonus_efficient-multihead-attention implements and compares different implementation variants of multihead-attention