RoboCup: A Case Study in Multiagent System

Vahid Mokhtari RoboCup: A Case Study in Multiagent System

Trends 1 What is an Agent? 2 Multiagent System 3 3 Case study in RoboCup 4 4 Contents

Trends in History of Computing

From Programming Perspective

What is an Agent?

Agent and Environment

Environment • Accessible vs. Inaccessible - can the agent “see” everything? • Deterministic vs. Non-deterministic - do actions have guaranteed effect? • Static vs. Dynamic - does the environment change on its own? • Discrete vs. Continuous - is the number of actions and percepts finite?

What is an Intelligent Agent?

Examples of Intelligent Agents • Assistant agent in MS Office • Trading agents • Web spiders • Computer viruses • Characters in computer games

Agents vs. Objects

Agent Oriented Architectures

Agent Types (1)

Agent Types (2)

Multiagent System (MAS)

The Fully General Multiagent System

Why MAS? • Some domains require it • Parallelism • Robustness • Scalability • Simpler programming • To study intelligence • Geographic distribution • Cost effectiveness

MAS Research Area • Distributed Computing: Processors share data, but not control. Focus on low-levelparallelization, synchronization. • Distributed AI: Control as well as data is distributed. Focus on problemsolving, communication, and coordination. • Distributed Problem Solving (DPS): Task decomposition and/or solution synthesis. • Multiagent Systems (MAS): Behavior coordination or behavior management.

MAS Taxonomy

Issues in Building MAS

Homogeneous Non-Communicating Multiagent Systems • Several different agents with identicalstructure (sensors, effectors, domain knowledge, and decision functions). • Different sensor input and effectors output. • Situated differently in the environment and they make their own decisions regarding which actions to take.

Heterogeneous Non-Communicating Multiagent Systems • Agents are situated differently in the environment • Different sensory inputs and different actions

Homogeneous Communicating Multiagent Systems • Agents are identical that they are situated differently in the environment • Agents can communicate together directly

Heterogeneous Communicating Multiagent Systems • Different sensory data, goals, actions, and domain knowledge

Learning Opportunities

Importance of MAS • Research in “Distributed AI” started over 30 years ago, but only in the mid of 1990s has it become a major research trend in AI. • Now the main conference (AAMAS) attracts around 800 submissions (of which 20-25% get accepted) each year. • In addition, there are dozens of smaller workshops and conferences. it’s a large, young and dynamic research community

RoboCup Case study in Multiagent System

What is RoboCup?

Domain Characteristics

The Standard Problem

RoboCup Soccer • Distributed • Multiagent • Teammates and adversaries domain • Partial world view • Noisy sensors and actuators • Real-time

Applied Machine Learning

Reinforcement Learning

The Agent-Environment Interface • Agent and environment interact at discrete time steps: t=0, 1, 2, … • Agent observes state at step t: st S • Produces action at step t: at A(st) • Gets resulting rewards: rt+1 R • And resulting next step: St+1

General Process of RL

Action Selection Policies

Exploration and Exploitation

Subtask of RoboCup Soccer

SARSA (State-Action-Reward-State-Action) • SARSA is a learning algorithm in the reinforcement learning area of machine learning. • On-policy learning method, • It learns state-action values (Q values).

SARSA Algorithm

Mapping Keepaway to SARSA

Hand-coded Algorithm

Result keepers hold the ball for about 8.2 seconds on average keepers hold the ball for about 12 seconds on average

Summary

References

Other Articles

Thanks For Your Attention

RoboCup: A Case Study in Multiagent System

RoboCup: A Case Study in Multiagent System

Presentation Transcript

Non-Traditional Marketing Case Study: Airport Marketing

RoboCup: An Application Domain for Distributed Planning and Sensoring in Multi-robot Systems

Case Study: USS Vincennes

Problem Based Learning: A Case Study

Use-case diagram for a university registration system

Shouldice Hospital Ltd. Case Study

HUD RHIIP Training

Case-control study 3 : Bias and confounding and analysis

Case Study in Parasitology

FIELD Research METHODS LECTURE 6: Small-N/Case Studies

XenApp 6 Case Studies and Troubleshooting

Anemia-Case Study

DALI Multiagent System

TSDM-34 ADIA Wrap-Up Case Study

Ulcerative Colitis Mini Case Study

STARTING YOUR OWN IT COMPANY

Definition of a Case Study

CASE STUDY, CASE DISCUSSION AND CASE MANAGEMENT

OJ Simpson Case Study Compilation

Web-based system to true-forecast disease epidemics – case study for Fusarium Head Blight of wheat