Quiz: State-Value Functions
In this quiz, you will calculate the value function corresponding to a particular policy.
Each of the nine states in the MDP is labeled as one of $\mathcal{S}^+ = \{s_1, s_2, \ldots, s_9 \}$, where $s_9$ is a terminal state.
Consider the (deterministic) policy that is indicated (in orange) in the figure below.





댓글을 달려면 로그인해야 합니다.