Basic Concepts
# Basic Concepts
- State
describes the agent's status with respect to the environment. In the grid world example, the state corresponds to the agent's location.
- Action
- State transtion
- Policy
A policy tells the agent which actions to take at every state
- Reward
Reward is one of the most unique concepts in reinforcement learning
- Trajectories
A trajectory is a state-action-reward chain. For example, given the policy shown in Fingure 1.6(a), if the agent can move along a trajectory as follows:
- Returns
- episodes
更新时间: 6/10/2024, 8:25:06 PM