Tutorial for how to use the jacobian of mjx.step #1601

Andrew-Luo1 · 2024-04-17T10:22:01Z

A notebook showing how to use vanilla policy gradients to accelerate policy learning, thanks to MJX's differentiability. Demonstrations are given for imitation learning and locomotion, for a quadruped. Requires Brax PR 476.

Supporting changes

Add an Anymal C model, cleaned for faster simulation on MJX similarly to MJX Barkour
Add an image for the notebook

…rial into main

Andrew-Luo1 · 2024-04-17T12:29:03Z

Just as a heads up:

Right now I hard-code the xml path, rather than use epath like in the other MJX tutorial - I could not get epath working locally.
For the PPO benchmarks, I ran into issues with nan rewards which dissapeared when I turned on the debug nans flag.

Updates APG to learn useful policies, see: google-deepmind/mujoco#1601

erikfrey · 2024-04-18T22:09:23Z

Exciting! How many changes did you make to anymal? Can you try this:

Make a PR to mujoco menagerie with anymal_c_mjx.xml and scene_mjx.xml with your anymal modifications
Somewhere at the top of your colab, do:

!git clone https://github.com/google-deepmind/mujoco_menagerie.git

Then you should be able to remove the anymal assets from this PR. Let me know if that works!

Andrew-Luo1 · 2024-04-19T04:55:30Z

Hi Erik, I've proposed Menagerie PR 50 with the Anymal C updates, and have documented the modifications in the readme. I've updated this notebook as well.

The formatting of this notebook isn't quite the same as the current mujoco notebooks. In the case that this PR goes through, of course feel free to modify it as you see fit: adding the Mujoco banner, adding a colab instance, etc!

And let me know if there's anything I can do to bring the notebook more up to standards.

…n get_obs, regenerate apg outputs

erikfrey

OK, I see your change is incorporated into menagerie. Looking great! One last nit and we will merge.

erikfrey · 2024-04-22T23:15:41Z

mjx/pyproject.toml

@@ -44,4 +44,4 @@ mjx-viewer = "mujoco.mjx.viewer:main"
 Homepage = "https://github.com/google-deepmind/mujoco/tree/main/mjx"
 Documentation = "https://mujoco.readthedocs.io/en/3.1.5"
 Repository = "https://github.com/google-deepmind/mujoco/tree/main/mjx"
-Changelog = "https://mujoco.readthedocs.io/en/3.1.5/changelog.html"
+Changelog = "https://mujoco.readthedocs.io/en/3.1.5/changelog.html"


I suspect this is an unneeded change - can you revert? Otherwise, let me know what this is for.

…rial into main

erikfrey

Wonderful, thank you!

Andrew-Luo1 added 12 commits April 15, 2024 08:12

add diagram for apg notebook

df26240

add anymal_c model foor mjx demo

27cddcb

all demos in tutorial work locally

f19e7be

slightly stabler quadruped training

daaf81f

upload videos

7db18c4

benchmark against ppo

7a7c33a

Merge branch 'google-deepmind:main' into main

4ae47dd

update text

9b22745

Merge branch 'main' of https://github.com/Andrew-Luo1/mujoco_apg_tuto…

ad86492

…rial into main

revert pyproject

8043e95

Merge branch 'google-deepmind:main' into main

b2a8ff9

fix typos

bfad82d

erikfrey self-assigned this Apr 17, 2024

Andrew-Luo1 added 2 commits April 18, 2024 04:50

refine text

568e1f2

change default option on brax apg

5d2986f

erikfrey pushed a commit to google/brax that referenced this pull request Apr 18, 2024

Updated basic APG algorithm (#476)

b45760c

Updates APG to learn useful policies, see: google-deepmind/mujoco#1601

Andrew-Luo1 added 2 commits April 19, 2024 06:30

git clone mujoco menagerie rather than import anymal_c locally

3e12452

update comments

ab7d8fe

Andrew-Luo1 added 3 commits April 21, 2024 17:15

Merge branch 'google-deepmind:main' into main

f3a9d77

further clean up text, add comments, remove irrelevant observations i…

eb6624b

…n get_obs, regenerate apg outputs

comments

001d2df

erikfrey requested changes Apr 22, 2024

View reviewed changes

Andrew-Luo1 added 5 commits April 23, 2024 06:12

Merge branch 'google-deepmind:main' into main

e6b95f3

revert pyproject

a1d5371

update apg diagram

938a62f

clean up text

a8aa15a

add a reference

9530aa1

Merge branch 'main' of https://github.com/Andrew-Luo1/mujoco_apg_tuto…

554771f

…rial into main

erikfrey approved these changes Apr 24, 2024

View reviewed changes

copybara-service bot merged commit 68e33f4 into google-deepmind:main Apr 24, 2024
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial for how to use the jacobian of mjx.step #1601

Tutorial for how to use the jacobian of mjx.step #1601

Andrew-Luo1 commented Apr 17, 2024

Andrew-Luo1 commented Apr 17, 2024

erikfrey commented Apr 18, 2024

Andrew-Luo1 commented Apr 19, 2024

erikfrey left a comment

erikfrey Apr 22, 2024

erikfrey left a comment

Tutorial for how to use the jacobian of mjx.step #1601

Tutorial for how to use the jacobian of mjx.step #1601

Conversation

Andrew-Luo1 commented Apr 17, 2024

Andrew-Luo1 commented Apr 17, 2024

erikfrey commented Apr 18, 2024

Andrew-Luo1 commented Apr 19, 2024

erikfrey left a comment

Choose a reason for hiding this comment

erikfrey Apr 22, 2024

Choose a reason for hiding this comment

erikfrey left a comment

Choose a reason for hiding this comment