Research Article

Kernel Temporal Differences for Neural Decoding

Figure 6

A -state Markov chain. In states from to , each state transition has probability , and state has transition probability to the absorbing state . Note that optimal state value functions can be represented as a nonlinear function of the states, and corresponding reward values are assigned to each state.