ReplayBuffer size with on-policy algorithm #1161
-
Hi, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
@jvasso essentially yes, it should not be smaller. If it's larger that's not a problem, since the buffer will be reset before the next learning step. This structure (of having to deal with the buffer size at all for an on-policy algo) is one of Tianshou's main technical debts, and we're planning to address this with a major refactoring that will be part of Tianshou 2.0.0 |
Beta Was this translation helpful? Give feedback.
@jvasso essentially yes, it should not be smaller. If it's larger that's not a problem, since the buffer will be reset before the next learning step.
This structure (of having to deal with the buffer size at all for an on-policy algo) is one of Tianshou's main technical debts, and we're planning to address this with a major refactoring that will be part of Tianshou 2.0.0