Multigoal Vision Language Navigation in Vizdoom

Using Fourier Transform for Improving Task-Oriented Language Grounding

Using Fourier Transform for Improving Task-Oriented Language Grounding
Nguyen T.Tin, Ngoc Duy Nguyen, Che Peng Lim, Asim Bhatti, Kim Yong Guk
Sejong University, Seoul, Korea
Deakin University, Australia

https://youtu.be/uF2CtWHsPfk

This repository contains:

Code for training an A3C-LSTM agent using Fourier Transform Attention in Single Goal or Multi Goals Env.

Dependencies

Usage

Using the Environment

For running a random agent:

python env_test.py

To play in the environment:

python env_test.py --interactive 1

To change the difficulty of the environment (easy/medium/hard):

python env_test.py -d easy

Training

For training a Stacked Attention A3C-LSTM agent with 4 threads:

python a3c_main.py --num-processes 4 --evaluate 0 --difficulty easy

For training a Stacked Attention and Auto-Encoder A3C-LSTM with agent with 4 threads:

python a3c_main.py --num-processes 4 --evaluate 0  --difficulty easy --auto-encoder

The code will save the best model at ./saved/.

Testing

To the test the pre-trained model for Multitask Generalization:

python a3c_main.py --evaluate 1 --load saved/pretrained_model

To the test the pre-trained model for Zero-shot Task Generalization:

python a3c_main.py --evaluate 2 --load saved/pretrained_model

To the visualize the model while testing add '--visualize 1':

python a3c_main.py --evaluate 2 --load saved/pretrained_model --visualize 1

To test the trained model, use --load saved/model_best in the above commands.


## Cite as
>Nguyen T.Tin, Ngoc Duy Nguyen, Che Peng Lim, Asim Bhatti, Kim Yong Guk.


## Acknowledgements
This repository uses ViZDoom API (https://github.com/mwydmuch/ViZDoom) and parts of the code from the API. This is a PyTorch implementation based on [this repo](https://github.com/devendrachaplot/DeepRL-Grounding).

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
ae		ae
analysis		analysis
attention/convolve_attention		attention/convolve_attention
data		data
env		env
language_model		language_model
maps		maps
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
a3c_main.py		a3c_main.py
a3c_test.py		a3c_test.py
a3c_train.py		a3c_train.py
threegoal_a3c_main.py		threegoal_a3c_main.py
threegoal_a3c_test.py		threegoal_a3c_test.py
threegoal_a3c_train.py		threegoal_a3c_train.py
twogoal_a3c_main.py		twogoal_a3c_main.py
twogoal_a3c_test.py		twogoal_a3c_test.py
twogoal_a3c_train.py		twogoal_a3c_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multigoal Vision Language Navigation in Vizdoom

Using Fourier Transform for Improving Task-Oriented Language Grounding

This repository contains:

Dependencies

Usage

Using the Environment

Training

Testing

About

Releases

Packages

Languages

ngthanhtin/Multigoal_VLN_Vizdoom

Folders and files

Latest commit

History

Repository files navigation

Multigoal Vision Language Navigation in Vizdoom

Using Fourier Transform for Improving Task-Oriented Language Grounding

This repository contains:

Dependencies

Usage

Using the Environment

Training

Testing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages