Jump to content

PVLV

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Omnipaedista (talk | contribs) at 01:34, 30 April 2014 (per MOS:BOLDSYN). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The primary value learned value (PVLV) model is a possible explanation for the reward-predictive firing properties of dopamine (DA) neurons.[1] It simulates behavioral and neural data on Pavlovian conditioning and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm.[2]

It is used as part of Leabra.

References