2014年2月23日日曜日

Dopamine prediction error responses integrate subjective value from different reward dimensions

Armin Lak, William R. Stauffer, and Wolfram Schultz
PNAS vol. 111 no. 6 2343–2348, doi: 10.1073/pnas.1321596111

ドーパミン神経細胞の活動は「様々な種類の刺激/報酬(色々な種類のジュース、不確実性を含んだ選択肢など)」の主観的価値に関する予測誤差をコードしている。

Prediction error signals enable us to learn through experience. These experiences include economic choices between different rewards that vary along multiple dimensions. Therefore, an ideal way to reinforce economic choice is to encode a prediction error that reflects the subjective value integrated across these reward dimensions. Previous studies demonstrated that dopamine prediction error responses reflect the value of singular reward attributes that include magnitude, probability, and delay. Obviously, preferences between rewards that vary along one dimension are completely determined by the manipulated variable. However, it is unknown whether dopamine prediction error responses reflect the subjective value integrated from different reward dimensions. Here, we measured the preferences between rewards that varied along multiple dimensions, and as such could not be ranked according to objective metrics. Monkeys chose between rewards that differed in amount, risk, and type. Because their choices were complete and transitive, the monkeys chose “as if” they integrated different rewards and attributes into a common scale of value. The prediction error responses of single dopamine neurons reflected the integrated subjective value inferred from the choices, rather than the singular reward attributes. Specifically, amount, risk, and reward type modulated dopamine responses exactly to the extent that they influenced economic choices, even when rewards were vastly different, such as liquid and food. This prediction error response could provide a direct updating signal for economic values.

0 件のコメント:

コメントを投稿