RL - stereoboy/Study GitHub Wiki
https://github.com/stereoboy/deep-reinforcement-learning
03 Policy-BasedContinuous Control
Project: Continuous Control
- PPO: Proximal Policy Optimization Algorithms
- A3C: Asynchronous Methods for Deep Reinforcement Learning
- D4PG: DISTRIBUTED DISTRIBUTIONAL DETERMINISTIC POLICY GRADIENTS
06 Benchmark Implementation
-
Benchmarking Deep Reinforcement Learning for Continuous Control
-
Blog Article: Proximal Policy Optimization
-
Deep Reinforcement Learning Doesn't Work Yet
04 Multi-Agent Reinforcement Learning
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Keywords
- Markov Games
- Cooperation, Competition, Mixed Environments
03 Case Study: AlphaZero
Mastering the Game of Go without Human Knowledge
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Project
https://github.com/stereoboy/deep-reinforcement-learning/tree/master/p3_collab-compet