Attention Is All You Need

This repo is a work-in-progress towards the goal of a minimal implementation of transformers with multi-head self attention, for my own curiosity and deeper understanding. The model is trained and evaluated on a toy dataset where the task is to reverse a sequence of integers.

Usage

Clone the repository:

git clone https://github.com/naivoder/AttentionIsAllYouNeed.git
cd AttentionIsAllYouNeed

Install the dependencies:
```
pip install -r requirements.txt
```
Run the training script:
```
python main.py
```

Acknowledgements

Special thanks to Aladdin Persson for his explanation of torch.einsum for the attention mechanism.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention Is All You Need

Usage

Acknowledgements

About

Releases

Packages

Languages

naivoder/AttentionIsAllYouNeed

Folders and files

Latest commit

History

Repository files navigation

Attention Is All You Need

Usage

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages