Skip to content

Latest commit

 

History

History
4 lines (2 loc) · 158 Bytes

File metadata and controls

4 lines (2 loc) · 158 Bytes

Implement the REINFORCE and PEGASUS algorithms and apply them to the $4\times 3$ world, using a policy family of your own choosing. Comment on the results.