[Slides]
ANSANGHO _ FAULT-TOLERANT CONTROLOFA QUADRUPED ROBOT.pdf
1. OverviewDeep Deterministic Policy Gradient (DDPG) is a model-free, off-policy reinforcement learning algorithm based on the Actor-Critic architecture. It combines the success of Deep Q-Networks (DQN) with the ability to handle continuous action spaces, making it a standard choice for high-dimensional robotic control tasks.
2. Why DDPG for Quadruped Robots? The primary reason for selecting DDPG for this project is its capability to handle Continuous Action Spaces.
3. Key Architecture: Actor-Critic DDPG employs two neural networks that work in tandem: