-
Notifications
You must be signed in to change notification settings - Fork 825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tutorial for how to use the jacobian of mjx.step #1601
Conversation
Just as a heads up:
|
Updates APG to learn useful policies, see: google-deepmind/mujoco#1601
Exciting! How many changes did you make to anymal? Can you try this:
!git clone https://github.com/google-deepmind/mujoco_menagerie.git Then you should be able to remove the anymal assets from this PR. Let me know if that works! |
Hi Erik, I've proposed Menagerie PR 50 with the Anymal C updates, and have documented the modifications in the readme. I've updated this notebook as well. The formatting of this notebook isn't quite the same as the current mujoco notebooks. In the case that this PR goes through, of course feel free to modify it as you see fit: adding the Mujoco banner, adding a colab instance, etc! And let me know if there's anything I can do to bring the notebook more up to standards. |
…n get_obs, regenerate apg outputs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, I see your change is incorporated into menagerie. Looking great! One last nit and we will merge.
mjx/pyproject.toml
Outdated
@@ -44,4 +44,4 @@ mjx-viewer = "mujoco.mjx.viewer:main" | |||
Homepage = "https://github.com/google-deepmind/mujoco/tree/main/mjx" | |||
Documentation = "https://mujoco.readthedocs.io/en/3.1.5" | |||
Repository = "https://github.com/google-deepmind/mujoco/tree/main/mjx" | |||
Changelog = "https://mujoco.readthedocs.io/en/3.1.5/changelog.html" | |||
Changelog = "https://mujoco.readthedocs.io/en/3.1.5/changelog.html" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suspect this is an unneeded change - can you revert? Otherwise, let me know what this is for.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wonderful, thank you!
68e33f4
into
google-deepmind:main
A notebook showing how to use vanilla policy gradients to accelerate policy learning, thanks to MJX's differentiability. Demonstrations are given for imitation learning and locomotion, for a quadruped. Requires Brax PR 476.
Supporting changes