reinforcement-learning 12
- [Unitree Go2 part 5] Sim2Real 성공
- [Unitree Go2 part 4] Feed-forward Torque 실험
- [Unitree Go2 part 3] Reward 수정과 Real Gap
- [Unitree Go2 part 2] 발을 떼지 않는 문제 분석
- [Unitree Go2 part 1] Sim2Real 첫 도전
- [IsaacLab Part 3] 강화학습으로 Go2 걷게하기
- 6. Policy Gradient DRL
- 5. DRL (Deep Reinforcement Learning)
- 4. RL (Reinforcement Learning)
- 3. DP (Dynamic Programming)
- 2. Bellman Equation
- 1. MDP (Markov Decision Process)