-
Notifications
You must be signed in to change notification settings - Fork 51
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
7 changed files
with
104 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
.. _dom-rand: | ||
Randomization Handler | ||
----------------------------------- | ||
.. automodule:: loco_mujoco.utils.domain_randomization | ||
:members: | ||
:undoc-members: | ||
:show-inheritance: | ||
:special-members: __call__ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,3 @@ | ||
.. _env-label: | ||
Basics | ||
================================= | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,21 @@ | ||
.. _dom-rand-tutorial: | ||
|
||
Domain Randomization | ||
================================= | ||
|
||
In this tutorial, we will show how to use the domain randomization feature. This feature is useful to train a | ||
robot to be robust to changes in the environment, such as joint friction, mass, or inertia. Before starting, make sure | ||
to get familiar with the :ref:`dom-rand`, where you find a detailed documentation. | ||
|
||
Consider the following domain randomization file for the Talos robot: | ||
|
||
.. literalinclude:: ../../../examples/domain_randomization/domain_randomization_talos.yaml | ||
:language: yaml | ||
|
||
Once a configuration file is created, we can pass it to the environment and start training as usual. | ||
Here is an example of how to use the domain randomization feature with the Talos robot: | ||
|
||
.. literalinclude:: ../../../examples/domain_randomization/example_talos.py | ||
:language: python | ||
|
||
.. note:: We provide more examples in respective directory in the main LocoMuJoCo repository. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,17 @@ | ||
Reinforcement Learning | ||
================================= | ||
|
||
Even though LocoMuJoCo focuses on imitation learning, it can be also used for plain reinforcement learning. The challenge | ||
here is to define a reward function that produces the desired behavior. Here is a minimal example for defining a reinforcement | ||
learning example: | ||
|
||
.. note:: This is for didactic purposes only! It will not produce any useful gait. | ||
|
||
.. literalinclude:: ../../../examples/reinforcement_learning/example_unitree_h1.py | ||
:language: python | ||
|
||
Right now, LocoMuJoCo only supports Markovian reward functions (i.e., functions only depending on the current | ||
state transition). We are thinking about providing support for non-Markovian reward functions as well by providing access | ||
to the environment in the reward function. Open an issue or drop me a message if you think this is something | ||
we should really do! | ||
|