Skip to content

Commit

Permalink
Deploying to gh-pages from @ e4966dd 🚀
Browse files Browse the repository at this point in the history
  • Loading branch information
rradules committed Jun 18, 2024
1 parent cefa05c commit 4b6c62a
Show file tree
Hide file tree
Showing 4 changed files with 7 additions and 6 deletions.
2 changes: 1 addition & 1 deletion .buildinfo
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Sphinx build info version 1
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
config: 1356913b104b7c99cad7dba39796c091
config: e381fbd463f240cdb4a063a1ddaf3f81
tags: d77d1c0d9ca2f4c8421862c7c5a0d620
8 changes: 4 additions & 4 deletions environments/mobeach/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -345,12 +345,12 @@ <h2>Action Space<a class="headerlink" href="#action-space" title="Link to this h
</section>
<section id="reward-space">
<h2>Reward Space<a class="headerlink" href="#reward-space" title="Link to this heading"></a></h2>
<p>The reward space is a 2D vector containing rewards for two different schemes (‘local’ or ‘global’) for:</p>
<p>The reward space is a 2D vector containing rewards for two different modes (‘individual’ or ‘team’) for:</p>
<ul class="simple">
<li><p>the occupation level</p></li>
<li><p>the mixture level
If the scheme is ‘local’, the reward is given for the currently occupied section.
If the scheme is ‘global’, the reward is summed over all sections.</p></li>
If the mode is ‘individual’, the reward is given for the currently occupied section.
If the mode is ‘team’, the reward is summed over all sections.</p></li>
</ul>
</section>
<section id="starting-state">
Expand All @@ -373,7 +373,7 @@ <h2>Arguments<a class="headerlink" href="#arguments" title="Link to this heading
<ul class="simple">
<li><p>‘num_timesteps (int)’: number of timesteps in the domain. Default: 1</p></li>
<li><p>‘num_agents (int)’: number of agents in the domain. Default: 100</p></li>
<li><p>reward_scheme (str)’: the reward scheme to use (‘local’, or ‘global’). Default: local</p></li>
<li><p>reward_mode (str)’: the reward mode to use (‘individual’, or ‘team’). Default: individual</p></li>
<li><p>‘sections (int)’: number of beach sections in the domain. Default: 6</p></li>
<li><p>‘capacity (int)’: capacity of each beach section. Default: 7</p></li>
<li><p>‘type_distribution (tuple)’: the distribution of agent types in the domain. Default: 2 types equally distributed (0.3, 0.7).</p></li>
Expand Down
1 change: 1 addition & 0 deletions environments/moitem_gathering/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -368,6 +368,7 @@ <h2>Arguments<a class="headerlink" href="#arguments" title="Link to this heading
<li><p>‘num_timesteps’: number of timesteps to run the environment for. Default: 10</p></li>
<li><p>‘initial_map’: map of the environment. Default: 8x8 grid, 2 agents, 3 objectives (Källström and Heintz, 2019)</p></li>
<li><p>‘randomise’: whether to randomise the map, at each episode. Default: False</p></li>
<li><p>‘reward_mode’: reward mode for the environment (‘individual’ or ‘team’). Default: ‘individual’</p></li>
<li><p>‘render_mode’: render mode for the environment. Default: None</p></li>
</ul>
</section>
Expand Down
2 changes: 1 addition & 1 deletion searchindex.js

Large diffs are not rendered by default.

0 comments on commit 4b6c62a

Please sign in to comment.