The primary value learned value (PVLV) model is a possible explanation for the reward-predictive firing properties of dopamine (DA) neurons. It simulates behavioral and neural data on Pavlovian conditioning and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm.
It is used as part of Leabra.
- O'Reilly, R.C.; Frank, M.J.; Hazy, T.E. & Watz, B. (2007). "PVLV: The Primary Value and Learned Value Pavlovian Learning Algorithm". Behavioral Neuroscience. 121 (1): 31–4. PMID 17324049. doi:10.1037/0735-7044.121.1.31.
- "Leabra PBWM". CCNLab.
|This neuroscience article is a stub. You can help Wikipedia by expanding it.|