Reinforcement Learning
Reinforcement Learning. Agenda. Online learning Reinforcement learning Model-free vs. model-based Passive vs. active learning Exploration-exploitation tradeoff. Incremental (“Online”) Function Learning. Data is streaming into learner x 1 ,y 1 , …, x n ,y n y i = f(x i )
956 views • 44 slides