GitHub - ChocoOreal/VQAProject: A small VQA project on DAQUAR dataset

This is a Visual question answering system, with SAN[1] as the fusion module, ViT as the image feature extractor, and BERT as the text feature extractor.

To install all dependencies, run

pip install -r requirements.txt

To run the program, dataset and model's weight need to be downloaded from here. After download completed, move the dataset folder and model.py file to the project root folder.

Run this command to launch the program:

streamlit run main.py

[1]: Yang, Zichao, et al. "Stacked attention networks for image question answering." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
dataloader.py		dataloader.py
main.py		main.py
model.py		model.py
predict.py		predict.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

ChocoOreal/VQAProject

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages