Skip to content
This repository has been archived by the owner on Aug 16, 2023. It is now read-only.

ChocoOreal/VQAProject

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a Visual question answering system, with SAN[1] as the fusion module, ViT as the image feature extractor, and BERT as the text feature extractor.

To install all dependencies, run

pip install -r requirements.txt

To run the program, dataset and model's weight need to be downloaded from here. After download completed, move the dataset folder and model.py file to the project root folder.

Run this command to launch the program:

streamlit run main.py

[1]: Yang, Zichao, et al. "Stacked attention networks for image question answering." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.

About

A small VQA project on DAQUAR dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages