Conditioning. Bear with me. Bare with me. Beer with me. Stay focused. Learning. Typically this subsides as this is learned. A. Two-process learning (Rescorla-Solomon 67) fast: fear and arousal slow: adaptive behavioral responses B. Three-process learning A
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Bear with me. Bare with me. Beer with me.
delivery of the reinforcer is contingent on the occurrence of a stimulus (the CS).
delivery of the reinforcer is contingent on the occurrence of a
designated responseClassical and Operant
CC predicts that the animal will produce UR/CR while performing
the desired action, but does not explain why the animal learns to
select the action.
Behavioral discrepancy is the change in an ongoing
behavior produced by the eliciting stimulus
Presentation of food produces salivation which
would not otherwise occur
Whenever a behavioral discrepancy occurs, an environment-behavior relation is selected that consists -- other things being equal -- of all those stimuli occurring immediately before the discrepancy and all those responses occurring immediately before and at the same time as the elicited response.
Under this principle there is no difference between
Classical and Operant conditioning as far as learning goes.
S1 and S2 processing should happen at roughly the same time so almost all models suggest a multiplicative relationship between levels of S1 and S2.
association strength update
sum of all the associative strengths at a given timeSutton-Barto model
discounted prediction of the future reward (V for predicted values of S)
This results in exactly the RW model with ML.
This is EM. Similar to comparator models of conditioning
(whatever they are). Has problems with inhibitory conditioning.
Dayan & Long’s model. Models the conditioning phenomena.
Does not consider associability (eligibility in SB) and attention.
No distinction between preparatory and consumatory conditioning
Warning: a personal opinion!
rates of reinforcement
cumulative number of
reinforcements in presence of Sn
cumulative duration of the conjunction of S1 and Sn
cumulative duration of Sn