Q-learning-DM

This is an example of dialogue manager based on reinforcement learning. The learning algorithm is based on Q-learning. The system converts the task structure that is written in data/dialogueframe.txt into probability distribution to work the belief update and user simulator. Details are written in the following paper.

Koichiro Yoshino, Shinji Watanabe, Jonathan Le Roux, John R. Hersh Statistical Dialogue Management using Intention Dependency Graph In Proc. IJCNLP2013. http://aclweb.org/anthology//I/I13/I13-1127.pdf

Required

python (3.5 or later)
numpy (1.13.3 or later)

How to run

python Qlearn.py

python simulator.py also work for the test of user simulator.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-learning-DM

Required

How to run

About

Releases

Packages

Languages

ahclab/Q-learning-DM

Folders and files

Latest commit

History

Repository files navigation

Q-learning-DM

Required

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages