Master Reinforcement Learning through mathematics, theory, implementations, visualizations, research papers, and real-world projects.
This repository is currently under active development.
Reinforcement Learning (RL) is the field of Artificial Intelligence that enables agents to learn optimal behavior through interaction with an environment.
From game-playing agents to autonomous robots and self-driving systems, RL powers some of the most advanced intelligent systems ever built.
MLVerse Reinforcement Learning aims to provide a complete open-source learning ecosystem covering everything from foundational concepts to state-of-the-art research.
- Agents and Environments
- States and Actions
- Rewards
- Policies
- Value Functions
- Exploration vs Exploitation
- Markov Property
- State Transition Dynamics
- Bellman Equations
- Policy Evaluation
- Policy Improvement
- Dynamic Programming
- Monte Carlo Methods
- Temporal Difference Learning
- SARSA
- Q-Learning
- Deep Q Networks (DQN)
- Double DQN
- Dueling DQN
- Rainbow DQN
- REINFORCE
- Actor-Critic
- A2C
- A3C
- PPO
- TRPO
- DDPG
- TD3
- SAC
- Cooperative Systems
- Competitive Systems
- Swarm Intelligence
- Distributed RL
- Robot Navigation
- Path Planning
- Autonomous Vehicles
- Industrial Robotics
reinforcement-learning
โ
โโโ fundamentals
โโโ markov-decision-processes
โโโ dynamic-programming
โโโ monte-carlo-methods
โโโ temporal-difference-learning
โโโ q-learning
โโโ sarsa
โโโ deep-q-networks
โโโ policy-gradient-methods
โโโ actor-critic-methods
โโโ ppo
โโโ a2c
โโโ a3c
โโโ sac
โโโ td3
โโโ multi-agent-rl
โโโ robotics
โโโ projects
โโโ research-papers
โโโ resources
Topics include:
- Probability Theory
- Statistics
- Linear Algebra
- Calculus
- Optimization
- Markov Chains
- Bellman Equations
- Dynamic Programming
Every topic will follow the MLVerse standard:
Topic
โ
โโโ README.md
โโโ Theory.md
โโโ Mathematics.md
โโโ Python-Implementation.ipynb
โโโ Visualization.ipynb
โโโ Applications-in-AI.md
โโโ Interview-Questions.md
โโโ Research-Papers.md
โโโ References.md
Build real-world Reinforcement Learning applications:
- CartPole Agent
- MountainCar Agent
- Lunar Lander
- Autonomous Navigation
- Stock Trading Agents
- Game Playing Agents
- Multi-Agent Systems
- Robotics Simulations
This repository will include:
- Landmark RL Papers
- Paper Reproductions
- Benchmark Studies
- OpenAI Gym Projects
- DeepMind Research Implementations
- Multi-Agent Research
- RL Fundamentals
- MDPs
- Dynamic Programming
- Monte Carlo Methods
- Temporal Difference Learning
- Q-Learning
- SARSA
- DQN
- PPO
- A2C
- A3C
- SAC
- TD3
- Multi-Agent RL
- Robotics Projects
- Research Paper Implementations
To build one of the world's most comprehensive open-source Reinforcement Learning ecosystems for learners, engineers, researchers, and innovators.
Shivam Singh
Founder of MLVerse-Math
Building the future of open-source AI education, research, and engineering.