Skip to content

Commit

Permalink
Deploying to gh-pages from @ c5ffc33 🚀
Browse files Browse the repository at this point in the history
  • Loading branch information
ffelten committed Mar 4, 2024
1 parent 399b884 commit 999b93c
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 7 deletions.
2 changes: 1 addition & 1 deletion .buildinfo
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Sphinx build info version 1
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
config: 92f91e2f517770ebc017784489e6c06e
config: 479e3b211f8dbf0073d98f35c52899c9
tags: d77d1c0d9ca2f4c8421862c7c5a0d620
15 changes: 10 additions & 5 deletions environments/momultiwalker_stability/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -317,7 +317,7 @@ <h1>MO-MultiwalkerStability<a class="headerlink" href="#mo-multiwalkerstability"
<td><p>Box(-inf, inf, (31,), float32)</p></td>
</tr>
<tr class="row-odd"><td><p>Reward Space</p></td>
<td><p>Box([-2.100e+02 -1.567e-02], [0.46 0. ], (2,), float32)</p></td>
<td><p>Box([-2.100e+02 -1.567e-02], [-209.54 0. ], (2,), float32)</p></td>
</tr>
<tr class="row-even"><td><p>Import</p></td>
<td><p><code class="docutils literal notranslate"><span class="pre">momaland.envs.momultiwalker_stability_v0</span></code></p></td>
Expand All @@ -337,14 +337,19 @@ <h2>Action Space<a class="headerlink" href="#action-space" title="Link to this h
</section>
<section id="reward-space">
<h2>Reward Space<a class="headerlink" href="#reward-space" title="Link to this heading">#</a></h2>
<p>The reward space is a 2D vector where; the first value contains the sum of following rewards:</p>
<p>The reward space is a 2D vector where; the first value contains the following reward:</p>
<ul class="simple">
<li><p>Maximizing distance traveled towards the end of the level during one step. <code class="docutils literal notranslate"><span class="pre">[-0.46,</span> <span class="pre">0.46]</span></code></p></li>
<li><p>Penalty for agent falling. <code class="docutils literal notranslate"><span class="pre">[-110,</span> <span class="pre">0]</span></code></p></li>
<li><p>Penalty for the package falling. <code class="docutils literal notranslate"><span class="pre">[-100,</span> <span class="pre">0]</span></code>
and the second value contains:</p></li>
</ul>
<p>and the second value contains:</p>
<ul class="simple">
<li><p>A penalty based on the change of angle of the package, to avoid shaking the package. <code class="docutils literal notranslate"><span class="pre">[-0.01567,</span> <span class="pre">0]</span></code></p></li>
</ul>
<p>Both these objectives are penalized with:</p>
<ul class="simple">
<li><p>Penalty for agent falling. <code class="docutils literal notranslate"><span class="pre">[-110,</span> <span class="pre">0]</span></code></p></li>
<li><p>Penalty for the package falling. <code class="docutils literal notranslate"><span class="pre">[-100,</span> <span class="pre">0]</span></code></p></li>
</ul>
</section>
<section id="episode-termination">
<h2>Episode Termination<a class="headerlink" href="#episode-termination" title="Link to this heading">#</a></h2>
Expand Down
Loading

0 comments on commit 999b93c

Please sign in to comment.