benchmark/
├── common/
├── diffusion/
│ └── text-to-image/
└── llm_text_generation/
├── chat/
└── code/
The common
directory is for utilities that are common to all benchmarking tasks.
Other than that, there is one directory for each type of model and subdirectories for more specific tasks.