Skip to content

AI University — Body of Knowledge

Reinforcement Learning

Reinforcement Learning¶

Learning to act by maximizing cumulative reward through interaction with an environment.

Reinforcement Learning is one of the core areas in the AI University map of AI. Explore the diagram, then dive into each topic — every subtopic grows into its own deep-dive over time.

flowchart LR
  A([Agent]) -- action --> E([Environment])
  E -- state + reward --> A

Key topics¶

Markov decision processes

The formal framework: states, actions, rewards, transitions, and policies.
Value & policy methods

Q-learning, policy gradients, actor-critic, and when to use each.
Deep RL

Combining RL with deep networks (DQN, PPO) for high-dimensional problems.
RLHF

Reinforcement learning from human feedback — how LLMs are aligned to preferences.
Multi-agent RL

Many agents learning together — cooperation, competition, and emergent behaviour.

Machine Learning · AI Agents & Autonomy · Robotics & Embodied AI

Learn this properly

Want hands-on training in reinforcement learning? Explore AI University courses and AI School camps for kids.