Terminology

Important Terms used in Reinforcement Learning:

Agent
Agent is the "decision-maker" while solving a sequential decision making problem. It can also be called "learner" i.e. the one who learns to make sequential decisions.
Environment
The world with which the agent interacts.
State (s)
The current situation of the agent/environment.
Action (a)
The move the agent chooses.
Reward (r)
Feedback signal for the agent’s action.
Policy (\pi)
The strategy the agent uses to decide actions.
Trajectory
A sequence of states, actions, and rewards.

Interaction Cycle

Loop: Observe state \(s_t\) Choose action (a_t) based on policy π(s_t) Receive reward (r_t) and next state (s_{t+1}) from the environment Update policy to improve future decisions

Important Terms used in Reinforcement Learning:

Interaction Cycle

MDP