Belief over policies changes rapidly #114
Unanswered
matteolimoncini
asked this question in
Q&A
Replies: 1 comment
-
Hi @matteolimoncini , Apologies for the delayed reply. It sounds like the fact that the beliefs about states and policies change rapidly from one trial to the next, is due to the construction of the generative model (your Depending on how these are constructed, it might be the case that |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm using pymdp to model an agent that attends an experiment composed of 160 trials, with 2 timestep for each trial.
The agent can do only two actions, and there are only two states.
During each trial the agent should do an observation, then it infers a policy, it chooses an action, it executes that action and does another observation; at the end of the trial the agent updates its beliefs about the hidden states.
The problem is that the belief over the states and the belief over policies change rapidly between all trials, and they are often around 0 or around 1, rarely assume values in between.
I tried to modify the gamma parameters in the Agent constructor without success (with gamma = 5, 2, 0.5, 0.2)
How can I do?
This is the pseucode of my problem
Beta Was this translation helpful? Give feedback.
All reactions