-
Notifications
You must be signed in to change notification settings - Fork 338
Pull requests: pytorch/rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Lint] pyupgrade
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
formatting
Code formatting changes
#2819
opened Feb 28, 2025 by
vmoens
Loading…
[Feature] LLMEnv and DataLoadingPrimer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2818
opened Feb 28, 2025 by
vmoens
Loading…
[BugFix] Fix batch_locked check in check_env_specs + error message callable
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2817
opened Feb 28, 2025 by
vmoens
Loading…
[Feature] NonTensor batched arg
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2816
opened Feb 28, 2025 by
vmoens
Loading…
[BugFix] Fix env.full_done_spec~s~
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2815
opened Feb 28, 2025 by
vmoens
Loading…
[DEBUG] ppo compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2814
opened Feb 27, 2025 by
IvanKobzarev
Loading…
10 tasks
[Feature,Deprecation] Split KLRewardTransform in more modules
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2813
opened Feb 27, 2025 by
vmoens
Loading…
[DRAFT, Example] Add MCTS example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2796
opened Feb 19, 2025 by
kurtamohler
•
Draft
[DRAFT] ppo chess with llm and ConditionalPolicySwitch to sunfish bot
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2763
opened Feb 5, 2025 by
mikaylagawarecki
•
Draft
[Feature] TensorDictPrimer with single default_value callable
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2732
opened Jan 30, 2025 by
vmoens
Loading…
[Feature] ConditionalPolicySwitch transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2711
opened Jan 21, 2025 by
vmoens
Loading…
[Example] Self-play chess PPO example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2709
opened Jan 21, 2025 by
vmoens
Loading…
[WIP] Compute lp during loss execution
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2688
opened Jan 10, 2025 by
vmoens
Loading…
[CI] Fix conda on windows
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2676
opened Dec 20, 2024 by
vmoens
Loading…
10 tasks
[Tutorial] MCTS
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673
opened Dec 19, 2024 by
vmoens
Loading…
First draft for modular Hindsight Experience Replay Transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
[Tutorial] Beam search with GPT models
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
tutorials
#2623
opened Dec 2, 2024 by
vmoens
Loading…
[Feature] PPOTrainer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550
opened Nov 11, 2024 by
vmoens
Loading…
[Feature] habitat env from config
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2539
opened Nov 6, 2024 by
vmoens
Loading…
10 tasks
[CI] Fix windows upload wheels
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2507
opened Oct 21, 2024 by
vmoens
Loading…
[Feature] Gymnasium 1.0 compatibility
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Environments
Adds or modifies an environment wrapper
#2473
opened Oct 9, 2024 by
vmoens
Loading…
[Examples] boiler plate code for multi-turn reward for RLHF
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2467
opened Oct 5, 2024 by
rghosh08
Loading…
3 of 10 tasks
[Algorithm] Update scripts with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2449
opened Sep 23, 2024 by
vmoens
Loading…
[Feature] RB compability with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2426
opened Sep 9, 2024 by
vmoens
Loading…
[CI] Add benchmarks to test runs
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2410
opened Sep 2, 2024 by
vmoens
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.