1-7-7. Quiz: State-Value Functions

Quiz: State-Value Functions

In this quiz, you will calculate the value function corresponding to a particular policy.

Each of the nine states in the MDP is labeled as one of $\mathcal{S}^+ = \{s_1, s_2, \ldots, s_9 \}$, where $s_9$​ is a terminal state.

Consider the (deterministic) policy that is indicated (in orange) in the figure below.

이 사이트는 스팸을 줄이는 아키스밋을 사용합니다. 댓글이 어떻게 처리되는지 알아보십시오.

%d 블로거가 이것을 좋아합니다: