Skip to content

TD3 basis implemented.

Latest
Compare
Choose a tag to compare
@CUN-bjy CUN-bjy released this 17 Jan 10:10

What is differences from DDPG

  1. Overestimation Bias Problem Solved

    • double Q-network
    • clipped double q-update
  2. Addressing Variance

    • delayed policy update
    • target policy smoothing