Terminology
Important Terms used in Reinforcement Learning:
- Agent
Agent is the "decision-maker" while solving a sequential decision making problem. It can also be called "learner" i.e. the one who learns to make sequential decisions.
- Environment
The world with which the agent interacts.
- State (s)
The current situation of the agent/environment.
- Action (a)
The move the agent chooses.
- Reward (r)
Feedback signal for the agent’s action.
- Policy (\pi)
The strategy the agent uses to decide actions.
- Trajectory
A sequence of states, actions, and rewards.
Interaction Cycle
Loop: Observe state \(s_t\) Choose action (a_t) based on policy π(s_t) Receive reward (r_t) and next state (s_{t+1}) from the environment Update policy to improve future decisions