Basic Concepts
Basic Concepts
- State
describes the agent's status with respect to the environment. In the grid world example, the state corresponds to the agent's location.
- Action
- State transtion
- Policy
A policy tells the agent which actions to take at every state
- Reward
Reward is one of the most unique concepts in reinforcement learning
- Trajectories
A trajectory is a state-action-reward chain. For example, given the policy shown in Fingure 1.6(a), if the agent can move along a trajectory as follows:
s1r=0a2s2r=0a3s5r=0a3s8r=1a2s9.
- Returns
- episodes