Extended Data Fig. 1: Mechanism of distributional TD. | Nature

Extended Data Fig. 1: Mechanism of distributional TD.

From: A distributional code for value in dopamine-based reinforcement learning

Extended Data Fig. 1

a, The degree of asymmetry in positive to negative scale determines the equilibrium where positive and negative errors balance. Equal scaling equilibrates at the mean, whereas a larger positive (negative) scaling produces an equilibrium above (below) the mean. b, Distributional prediction emerges through experience. Quantile (sign function) version is displayed here for clarity. Model is trained on arbitrary task with trimodal reward distribution. c, Same as b, viewed in terms of cumulative distribution (left) or learned value for each predictor (quantile function) (right).

Back to article page