Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Writing tutorial for deep reinforcement learning implementation with pytorch from scratch #1104

Closed
wants to merge 43 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
de25e40
Defining tutorial structure, as well as starting with code implmeneta…
hahas94 Dec 10, 2023
465cba0
Continuing with implementation of replay memory and testing its corre…
hahas94 Dec 11, 2023
35e9a6c
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Jan 5, 2024
eeaf42b
Adding the skeleton for the agent class.
hahas94 Jan 5, 2024
3d3946c
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Feb 13, 2024
df387aa
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Feb 16, 2024
3f6d2a6
Finishing the implementation of the agent class.
hahas94 Feb 16, 2024
9626439
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Feb 20, 2024
e89873b
Starting with implementation of the training function.
hahas94 Feb 20, 2024
c5b9735
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Mar 20, 2024
0c21314
Finishing with implementing the train function and training an agent …
hahas94 Mar 20, 2024
a03ed9a
Merge remote-tracking branch 'upstream/main' into tutorial
hahas94 Mar 26, 2024
8edad3b
Implemented functions for plotting performance of agents.
hahas94 Mar 26, 2024
76ea93a
Merge remote-tracking branch 'upstream/main' into tutorial
Apr 6, 2024
3e78de8
Performing some cleaning in documentation, typing etc.
Apr 6, 2024
d1a7a07
Merge remote-tracking branch 'upstream/main' into tutorial
Apr 16, 2024
5beef16
Merge remote-tracking branch 'upstream/main' into tutorial
Apr 17, 2024
ecaf16e
Further implementing visualization functions, as well as cleaning up …
Apr 17, 2024
1b63b5e
Merge remote-tracking branch 'upstream/main' into tutorial
Apr 18, 2024
2da7d07
Further developing visualization function for new metrics, as well as…
Apr 18, 2024
dd7692e
Merge remote-tracking branch 'upstream/main' into tutorial
Apr 26, 2024
f515e2b
Slightly refactoring Memory and Agent classes, adding functionality f…
Apr 26, 2024
a0c7d20
Merge remote-tracking branch 'upstream/main' into tutorial
May 24, 2024
b3a0bda
This commit contains a number of changes in the entire file. changed …
May 24, 2024
4ebabce
Merge remote-tracking branch 'upstream/main' into tutorial
Jun 2, 2024
a3d6902
Fixing one bug in the Agent class and making some small changes.
Jun 2, 2024
257c2ea
Made hyperparameters a dataclass instead of namedtuple for easier def…
Jun 3, 2024
b0dbdf6
Merge remote-tracking branch 'upstream/main' into tutorial
Jun 30, 2024
c6d6753
Correcting spelling in to_tensor() function and changing structure.
Jun 30, 2024
8bf71f3
Updating create_env function to do what it is supposed to do only.
Jun 30, 2024
ef78103
Updating docs in parallel_training() and visualize_performance() func…
Jun 30, 2024
846964a
Replacing replay memory class with a simpler one.
Jun 30, 2024
454a070
Decoupling agent and network, creating a separate class for the netwo…
Jun 30, 2024
eed44a6
Updating the train() function to match the changes in the agent class…
Jun 30, 2024
fe4dc6f
Updating docs for the random_agent_play() function.
Jun 30, 2024
b5c02c1
Adding new results, as well as some other smaller changes.
Jun 30, 2024
bdeee3e
Restructuring the file, adding text, and fixing small documentation p…
Jul 1, 2024
6fd69ce
Adding new results visualizations and updating old ones.
Jul 1, 2024
cdfaa46
Merge remote-tracking branch 'upstream/main' into tutorial
Jul 1, 2024
2026f7b
Merge remote-tracking branch 'upstream/main' into tutorial
Jul 9, 2024
141f27e
Addressing review comments.
Jul 9, 2024
94fc663
Merge remote-tracking branch 'upstream/main' into tutorial
Jul 13, 2024
4d7c17f
Addressing review comment.
Jul 13, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added docs/_static/img/tutorials/drl_Acrobot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/img/tutorials/drl_CartPole.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/img/tutorials/drl_LunarLander.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/videos/tutorials/drl_Acrobot.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/videos/tutorials/drl_CartPole.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1,095 changes: 1,095 additions & 0 deletions docs/tutorials/training_agents/deep_rl_tutorial.py

Large diffs are not rendered by default.