1 / 24

Jay McClelland

Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing. Jay McClelland Department of Psychology and Center for Mind Brain and Computation Stanford University.

turnerdavid
Download Presentation

Jay McClelland

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Dynamical Models of Decision MakingOptimality, human performance, andprinciples of neural information processing Jay McClelland Department of Psychology and Center for Mind Brain and ComputationStanford University

  2. Is the rectangle longer toward the northwest or longer toward the northeast?

  3. Longer toward the Northeast! 1.99” 2.00”

  4. An Abstract Statistical Theoryof the time course of decision makingwith uncertain information • How do you decide if an urn contains more black balls or white balls? • We assume you can only draw balls one at a time and want to stop as soon as you have enough evidence to achieve a desired level of accuracy. • Optimal policy: • Reach in and grab a ball • Keep track of difference between the # of black balls and # of white balls. • Respond when the difference reaches a criterion value C. • Accuracy can be improved by increasing C but this slows down the decision. • Policy produces fastest decisions for specified level of accuracy at a given level of discriminability, (b-w)/(b+w)

  5. The Drift Diffusion Model • Continuous version of the SPRT • At each time step a small random step is taken. • Mean direction of steps is +m for one direction, –m for the other. • When criterion is reached, respond. • Alternatively, in ‘time controlled’ tasks, respond when signal is given.

  6. Two Problems with the DDM Easy • Accuracy should gradually improve toward ceiling levels, even for very hard discriminations, but this is not what is observed in human data. • The model predicts correct and incorrect RT’s will have the same distribution, but incorrect RT’s are generally slower than correct RT’s. Prob. Correct Hard Errors Correct Responses RT Hard -> Easy

  7. The search for emergent regularity from complex neural processes • Human behavior is often characterized by simple regularities at an overt level, yet this simplicity arises from a highly complex underlying neural mechanism. • Can we understand how these simple regularities could arise?

  8. Usher and McClelland (2001)Leaky Competing Accumulator Model • Addresses the process of decidingbetween two alternatives basedon external input (r1 + r2 = 1) with leakage, self-excitation, mutual inhibition, and noise: dy1/dt = r1-l(y1)+af(y1)–bf(y2)+x1 dy2/dt = r2-l(y2)+af(y2)–bf(y1)+x2

  9. Wong & Wang (2006) ~Usher & McClelland (2001)

  10. Dynamics of winning and loosing populations in the brain and the LCAM

  11. Roles of (k = l–a) and b

  12. Testing the model • Quantitative test: • Predicted shapes of ‘time-accuracy curves’ • Qualitative test: • Understanding the dynamics of the model leads to a novel prediction

  13. Time-accuracy curves for different |k-b| |k-b| = 0 |k-b| = .2 |k-b| = .4

  14. Assessing Integration Dynamics • Participant sees stream of S’s and H’s • Must decide which is predominant • 50% of trials last ~500 msec, allow accuracy assessment • 50% are ~250 msec, allow assessment of dynamics • Equal # of S’s and H’sBut there are clusters bunched together at the end (0, 2 or 4). Inhibition-dom. Leak-dominant Favored early Favored late

  15. S3 Subjects show both kindsof biases; the less the bias,the higher the accuracy,as predicted.

  16. Extension to N alternatives • Extension to n alternatives is very natural (just add units to pool). • Model accounts quite well for Hick’s law (RT increases with log n alternatives), assuming that threshold is raised with n to maintain equal accuracy in all conditions. • Use of non-linear activation function increases efficiency in cases where there are only a few alternatives ‘in contention’ given the stimulus.

  17. Neural Basis of Decision Making in Monkeys (Roitman & Shadlen, 2002) RT task paradigm of R&T. Motion coherence anddirection is varied fromtrial to trial.

  18. Neural Basis of Decision Making in Monkeys: Results Data are averaged over many different neurons that areassociated with intended eye movements to the locationof target.

  19. Wong & Wang (2006)

  20. Behavioral Data

  21. Integration of reward and motioninformation (Rorie & Newsome) Monkey’s choices reflect a beautifully regular combination of the effects of stimulus coherence and reward.

  22. Population response of LIP Neurons in two reward conditions Choosein Chooseout

  23. Some Open Questions • How localized vs. distributed are the neural populations involved in decisions? • What does it mean in the brain to pass the threshold for response initiation? • How are brain dynamics tuned to allow a practiced participant to optimize decisions, combining prior and current information in just the right mix to achieve (near) optimality?

More Related