Version 1.2.4 (Check the "Patch Note.md" file in the "doc" folder

VOCdevShy · Apr 12, 2024 · b90b176 · b90b176
1 parent 73413cf
commit b90b176
Show file tree

Hide file tree

Showing 6 changed files with 198 additions and 97 deletions.
diff --git a/Doc/Bug list.md b/Doc/Bug list.md
@@ -1,34 +1,34 @@
-<h2>This is the bug list. In this files, all the bugs/problems of the program are detailed. <br>(If you know how to fix a bug or if you found one, please open an issue)</br></h2> 
+<h2 align="center">This is the bug list. In this files, all the bugs/problems of the program are detailed. <br>(If you know how to fix or if you found one, please open an issue on <a href="https://github.com/VOCdevShy/Q-Learning_Frozen_Lake/issues">GitHub</a>)</br></h2> 
 
 1. Sequence lag/Sequence recording lag (Link to problem 2 and 3) (Fix)
-    - explication: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
+    - Explication of the fix: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done.
 
 <p> </p>
 
 2. Output broadcast lag (Link to problem 1 and 3) (Fix)
-    - explication: Add `time.sleep(0.7)` line 84 to reset the environment correctly
+    - Explication of the fix: Add `time.sleep(0.7)` line 84 to reset the environment correctly.
 
 <p> </p>
 
 3. Agent Input start to early (Link to problem 1 and 2) (Fix)
-    - explication: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
+    - Explication of the fix: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done.
 
 <p> </p>
 
 4. "Error" message after the presentation of the Virgin Q-Table:
- `/home/runner/Q-Learning-Frozen-lake/.pythonlibs/lib/python3.10/site-packages/gym/utils/passive_env_checker.py:233: DeprecationWarning: np.bool8 is a deprecated alias for np.bool_.  (Deprecated NumPy 1.24)
+<br>`/home/runner/Q-Learning-Frozen-lake/.pythonlibs/lib/python3.10/site-packages/gym/utils/passive_env_checker.py:233: DeprecationWarning: np.bool8 is a deprecated alias for np.bool_.  (Deprecated NumPy 1.24)
       if not isinstance(terminated, (bool, np.bool8)):" (Not entierly Fix)`
-    - explication: The problem is link between numpy and gymnasium package. With the Version of the 6/02/24 (1.23.4) of numpy the problem is not solve entierly and this message doesn't impact the code so i hide it with a warning.
+    - Explication of the fix: The problem is link between numpy and gymnasium package. With the Version of the 6/02/24 (1.23.4) of numpy the problem is not solve entierly and this message doesn't impact the code so i hide it with a warning.
 
 <p> </p>
 
 5. The agent might be doing 2 actions by input and not only 1? (link to problem 1, 2, 3) (Fix)
-    - explication: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
+    - Explication of the fix: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
 
 <p> </p>
 
 6. The agent is doing the action 0 (left) to much time when epsilon < Q-Table (Fix)
-    - explication: It is because np.argmax(qtable[state]) return [0. 0. 0. 0.] and those values are = 0 so this first action it take is the first one (LEFT) so to correct it we modified the code to:
+    - Explication of the fix: It is because `np.argmax(qtable[state])` return [0. 0. 0. 0.] and those values are equlas to 0 so this first action it take is the first one (LEFT) so to correct it we modified the code to add:
       ```
       if np.argmax(qtable[state] == 0:
           action = env.render.sample()
@@ -37,18 +37,21 @@
 <p> </p>
 
 7. Teleportation of the agent on a case like he is doing two actions in one and this appear in one action in the sequence (Link to problem 3, 5) (Fix)
-    - explication: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
+    - Explication of the fix: Add `time.sleep(0.7)` line 84 + delete of `action = env.action_space.sample()` before the while not done
 
 <p> </p>
 
  8. When the test of the Q-Table is going it is possible for the agent to have a problem of going left and right infinitly on a case (fix)
-    - explication: It is because np.argmax(qtable[state]) return [0. 0. 0. 0.] and those values are = 0 so this first action it take is the first one (LEFT) so to correct it we modified the code to:
+    - Explication of the fix: It is because `np.argmax(qtable[state])` return [0. 0. 0. 0.] and those values are equlas to 0 so this first action it take is the first one (LEFT) so to correct it we modified the code to add:
       ```
       if np.argmax(qtable[state] == 0:
-        action = env.render.sample().
+          action = env.render.sample()
       ```
 
 <p> </p>
 
-9. When the sequences with their episode's number they are displaced by -1. Example, if the agent reach the goal on the first episodes when he will be printed, his named will be `episode 0` and not `episode 1`. (not fix)
-    - explication:
+9. When the sequences with their episode's number they are displaced by -1. Example, if the agent reach the goal on the first episodes when he will be printed, his named will be `episode 0` and not `episode 1`. (fix)
+    - Explication of the fix: At the start of the loops put `episode += 1` and in the `print(f"Episode: {episodes +1}")` erase the `+1`.
+
+<p> </p>
+
diff --git a/Doc/Patch Note.md b/Doc/Patch Note.md
@@ -1,5 +1,5 @@
 ### Version: 1.0.1 (minor update) date: 29/02/2024 at: 10:20 a.m (CET(UTC+1)):
-- Half resolved the bug n°4 (Check the "Bug List.md files" to see the explication)
+- Half resolved the bug n°4 (Check the `Bug List.md files` to see the explication of te fix)
 
 ## Version: 1.1.0 (major update) date: 01/03/2024 at: 09:53 a.m (CET(UTC+1)):
 - Implementation of the test of the Q-Table after training. For a 100 episodes you can try your Q-Table to see if the update is good or not.
@@ -48,4 +48,12 @@
 - Adding an example of Training + test results (Check the `4x4 Results.txt` file in the `4x4` folder in the `Q-Learning-Results` folder)
 - Bug/Problem fix (listed) (For more informations about the Bugs/Problems checks the `Bug list.md` file in the `doc` folder):
   - n°6
-  - n°8
+  - n°8
+
+### Version 1.2.4 (minor update) date: 12/04/2024 at: 5:00 p.m (CET(UTC+1)):
+- Re-layout of the `README.md`, `Bug List.md`, `Patch Note.md` files
+- Replacing by an another example the example of final result for the 4x4 map in the `4x4 Results.txt` + adding a picture of the outcomes of the training + test in the `4x4` folder in the `Q-Learning-Results` folder
+- Bug/Problem fix (listed) (For more informations about the Bugs/Problems checks the `Bug list.md` file in the `doc` folder):
+  - n°9
+- Bug/Problem fix (not listed):
+  - `total_actions` are all actions of the training + test part. To change this we reset the value at the start of the test part.