140 likes | 213 Views
Explore how intelligent agents learn through reinforcement to find optimal policies. Understand pros and cons of this approach. Solid foundation for RL concepts, with a focus on empirical learning. Distinguish between observations and states, various ML types (supervised, unsupervised, RL), value-based vs policy search, and dynamic programming vs model-based learning.
E N D
Cpt_S 580-03Intelligent Agents (Reinforcement Learning)
Actions: Wave, Stand, Clap • Observations: Colors, Rewards • Goal: find an optimal policy
Formalize the Problem • Knowns • Unknowns
(UP) Wave • (RIGHT) Stand • (DOWN) Clap
Agent can change policy • Representation and Reward as given • Pros: • Just specify goals • can be much less work than programming • unexpected situations • Cons: • Can be slow • need to pick representation & rewards
Goals for Our Class • Solid foundation for RL • More focus on empirical than theoretical
Reduced Formalism • Knowns • Unknowns
Observations vs. State • Types of ML: Supervised, Unsupervised, RL • Value-based vs. Policy Search • Dynamic Programming vs. Model Free vs. Model-based (Model-learning) • Explore vs. Exploit