reinforcement learning example l.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Reinforcement learning example PowerPoint Presentation
Download Presentation
Reinforcement learning example

Loading in 2 Seconds...

play fullscreen
1 / 13

Reinforcement learning example - PowerPoint PPT Presentation


  • 601 Views
  • Uploaded on

Reinforcement learning example. Arrows indicate strength between two problem states Start maze …. Start. S 2. S 4. S 3. S 8. S 7. S 5. Goal. The first response leads to S2 … The next state is chosen by randomly sampling from the possible next states

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Reinforcement learning example' - Leo


Download Now An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
reinforcement learning example
Reinforcement learning example

Arrows indicate strength between two problem states

Start maze …

Start

S2

S4

S3

S8

S7

S5

Goal

slide2

The first response leads to S2 …

The next state is chosen by randomly sampling from the possible next states

weighted by their associative strength

Associative strength = line width

Start

S2

S4

S3

S8

S7

S5

Goal

slide4

At S3, choices lead to either S2, S4, or S7.

S7 was picked (randomly)

Start

S2

S4

S3

S8

S7

S5

Goal

slide6

Next response is S4

Start

S2

S4

S3

S8

S7

S5

Goal

slide8

And the goal is reached …

Start

S2

S4

S3

S8

S7

S5

Goal

slide9

Goal is reached,

strengthen the associative connection between goal state and last response

Next time S5 is reached, part of the associative strength is passed back to S4...

Start

S2

S4

S3

S8

S7

S5

Goal

slide10

Start maze again…

Start

S2

S4

S3

S8

S7

S5

Goal

slide12

S5 is likely to lead to GOAL through strenghtened route

In reinforcement learning, strength is also passed back to the last state

This paves the way for the next time going through maze

Start

S2

S4

S3

S8

S7

S5

Goal