updated the step function inside MDPEnv class #6

vaishn99 · 2023-01-23T09:57:28Z

Found a bug and solved it.

I have noticed a bug in the library.

Modification:

Change is made for the step function, which is inside MDPEnv

explanation:

choosing "next_state and reward" is not synchronized, each of them is independently sampled previously, but they should be synchronized as per the definition.

updated the step function inside MDPEnv class

eba666b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated the step function inside MDPEnv class #6

updated the step function inside MDPEnv class #6

vaishn99 commented Jan 23, 2023

updated the step function inside MDPEnv class #6

Are you sure you want to change the base?

updated the step function inside MDPEnv class #6

Conversation

vaishn99 commented Jan 23, 2023

Found a bug and solved it.

Modification:

explanation: