0% found this document useful (0 votes)

62 views21 pages

Multi-Agent Systems and Strategic Decision Making: Module CS4760

1. Reinforcement learning involves an agent learning how to maximize rewards through interaction with an environment without an explicit teacher. 2. The key components of a reinforcement learning system are the policy, which defines the agent's behavior, the reward signal which defines the goal, the value function which estimates the long-term desirability of states and actions, and the environment model which predicts the results of actions. 3. Reinforcement learning differs from other machine learning methods in that it involves sequential decision making where feedback may be delayed and actions have long-term consequences.

Uploaded by

Tùng Đào

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views21 pages

Multi-Agent Systems and Strategic Decision Making: Module CS4760

Uploaded by

Tùng Đào

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Module CS4760

Multi-Agent Systems and

Strategic Decision Making

Lecture 1: Introduction to
Reinforcement Learning
Maria Chli

Based on D. Silver’s, Fei-Fei Li, J. Johnson & S. Yeung lecture series and
Sutton & Barto 2nd ed (2018)
Learning Outcomes
By the end of this lecture you should
be able to
• define a Reinforcement Learning System
and outline its major components
• recognise decision-making problems that
lend themselves to RL
• differentiate RL from other strands of
Machine Learning

2
What is an Agent?
An agent is a computer system capable of flexible
autonomous action in some environment in order to
meet its design objectives. [Wooldridge and
Jennings,1995]

perception

decision

action

3
What is Reinforcement Learning?

• We consider the problem of learning

how to act, through experience and
without an explicit teacher.
• An RL agent must interact with its
world and from that learn how to
maximize some cumulative reward
over time.

4
Reinforcement Learning
Observation
Reward

Environment Agent

Action

Goal: Learn how to take actions in order to maximise reward

5
What is Reinforcement Learning?
Computer
Science

Engineering
Neuroscience
Machine Learning

Control Reward System

RL
Operational Research
Mathematics Psychology
Classical/Operant
Conditioning

Bounded Rationality

Economics

The problem: decision making 6

Machine learning - branches

Reinforcement
Learning

Unsupervised Supervised
Learning Learning

7
Characteristics of RL
What makes RL different from other machine
learning paradigms?
• There is no supervisor, only a reward signal
• Feedback may be delayed, not
instantaneous
• Time really matters (sequential data, non
i.i.d)
• Agent senses environment and its actions
affect the environment
8
Reinforcement Learning
Observation
Reward

Environment Agent

Action

Goal: Learn how to take actions in order to maximise reward

9
Reinforcement Learning

Environment Agent

10
Reinforcement Learning

State st

Environment Agent

11
Reinforcement Learning

State st

Environment Agent

Action at

12
Reinforcement Learning
State st
Reward Rt

Environment Agent

Action at

13
Reinforcement Learning
State st
Reward Rt
Next state st+1

Environment Agent

Action at

Goal: Learn how to take actions in order to maximise reward

14
Examples: Cart pole problem
Objective: Balance a pole on top of a movable cart

State: angle, angular speed, position, horizontal velocity

Action: horizontal force applied on the cart
Reward:1 at each time step if the pole is upright

15
Examples: Robot Locomotion
Objective: Make the robot move forward

State: angle and position of the joints

Action: Torques applied on joints
Reward:1 at each time step upright & forward movement

16
Examples: Atari Games
Objective: Complete the
game with the highest score

State: Raw pixel inputs of

the game state
Action: Game controls e.g.
Left, Right, Up, Down
Reward: Score
increase/decrease at each
time step

17
Examples: Go
Objective: Win the game!

State: Position of all pieces

Action: Where to put the
next piece down
Reward: 1 if win at the end
of the game, 0 otherwise

Many more examples:

• Fly stunt manoeuvres in a helicopter
• Defeat the world champion at Backgammon
• Manage an investment portfolio
• Control a power station 18
Sequential Decision Making
• Goal: select actions to maximise total future
reward
• Actions may have long term consequences
• Reward may be delayed
• It may be better to sacrifice immediate reward to
gain more long-term reward. Examples:
– A financial investment (may take months to mature)
– Blocking opponent moves (might help winning chances
many moves from now)

19
Major Components of an RL
system
1. Policy: deﬁnes agent’s behaviour function
Learning agent’s way of behaving at a given time
2. Reward signal deﬁnes the goal of an RL problem
Agent’s objective: maximize the total reward it receives in
the long run
3. Value function: how good is each state and/or action
Whereas rewards determine the immediate desirability of
states, values indicate their long-term desirability
4. Model: agent’s representation of the environment
Eg. given a state and action, the model predicts the
resultant next state and reward. Models are used for
deciding on a course of action by considering possible
future situations before they are actually experienced. 20
Learning Outcomes
By the end of this lecture you should
be able to
• define a Reinforcement Learning System
and outline its major components
• recognise decision-making problems that
lend themselves to RL
• differentiate RL from other strands of
Machine Learning

CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
DLMAIRIL01 Q4-2024 Session1
No ratings yet
DLMAIRIL01 Q4-2024 Session1
84 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
64 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Lecture 3 RL Basics Part3
No ratings yet
Lecture 3 RL Basics Part3
37 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
R22ML 5
No ratings yet
R22ML 5
24 pages
Lecture 9 - Reinforced Learning
No ratings yet
Lecture 9 - Reinforced Learning
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
30 pages
Unit 4
No ratings yet
Unit 4
56 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
F90de-Introduction To Reinforcement Learning
No ratings yet
F90de-Introduction To Reinforcement Learning
67 pages
Lect1 introRL
No ratings yet
Lect1 introRL
52 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Module 01
No ratings yet
Module 01
66 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
DRL Final Notes
No ratings yet
DRL Final Notes
281 pages
Lec 01
No ratings yet
Lec 01
60 pages
Reinforcement Learning (RL) : Agent
No ratings yet
Reinforcement Learning (RL) : Agent
35 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Reinforcement Learning - Introduction
No ratings yet
Reinforcement Learning - Introduction
19 pages
Reinforcement Learning, Q-Learning
No ratings yet
Reinforcement Learning, Q-Learning
20 pages
Module - 1 - Reinforcement Learning and Markov Decision Process
No ratings yet
Module - 1 - Reinforcement Learning and Markov Decision Process
19 pages
Lecture - 01 - Introduction - I
No ratings yet
Lecture - 01 - Introduction - I
15 pages
AI Unit - 3
No ratings yet
AI Unit - 3
102 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
35 pages
Mental Representations A Dual Coding Approach Authorized Download
100% (11)
Mental Representations A Dual Coding Approach Authorized Download
14 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Artificial Intelligence: Computer Science & Engineering, Khulna University
No ratings yet
Artificial Intelligence: Computer Science & Engineering, Khulna University
30 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
28 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
ML Unit-4
No ratings yet
ML Unit-4
10 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Module 1
No ratings yet
Module 1
72 pages
Unit 1 - Reinforcement Learning, Overfitting, Training, Validation Sets, Metrics, Bias and Variance
No ratings yet
Unit 1 - Reinforcement Learning, Overfitting, Training, Validation Sets, Metrics, Bias and Variance
16 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
4.1 Reinforcement Learning 2
No ratings yet
4.1 Reinforcement Learning 2
31 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
23 pages
Lec 1 Intro Course Overview
No ratings yet
Lec 1 Intro Course Overview
50 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Reinforcement Learning: Nguyen Do Van, PHD
No ratings yet
Reinforcement Learning: Nguyen Do Van, PHD
40 pages
Markov Decision Process and Reinforcement Learning
No ratings yet
Markov Decision Process and Reinforcement Learning
36 pages
Reinforcement Learning and Robotics
No ratings yet
Reinforcement Learning and Robotics
35 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Unit 5
No ratings yet
Unit 5
45 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
13 pages
Reinforcement learning-WPS Office
No ratings yet
Reinforcement learning-WPS Office
1 page
A Beginner's Guide To Deep Reinforcement Learning: Skymind - Ai
No ratings yet
A Beginner's Guide To Deep Reinforcement Learning: Skymind - Ai
23 pages
Social Psychology 10th Edition by Elliot Aronson (Ebook PDF) PDF Download
100% (3)
Social Psychology 10th Edition by Elliot Aronson (Ebook PDF) PDF Download
57 pages
Psychological Assessment Report: 1) NIMHANS Neuropsychological Battery
75% (4)
Psychological Assessment Report: 1) NIMHANS Neuropsychological Battery
3 pages
Many Headed Monster
No ratings yet
Many Headed Monster
13 pages
Artificial Intelligence Comprehension Answers
0% (1)
Artificial Intelligence Comprehension Answers
2 pages
Psychological Science and The Law - (5. Eyewitness Memory)
No ratings yet
Psychological Science and The Law - (5. Eyewitness Memory)
26 pages
CDN Ed Developmental Psychology Childhood and Adolescence 4th Edition Shaffer Test Bank
100% (32)
CDN Ed Developmental Psychology Childhood and Adolescence 4th Edition Shaffer Test Bank
43 pages
Henry & Bettenay 2010
No ratings yet
Henry & Bettenay 2010
11 pages
The New York City AI Plan
No ratings yet
The New York City AI Plan
51 pages
THC6 Lesson 2 - Introduction To Personality 2
No ratings yet
THC6 Lesson 2 - Introduction To Personality 2
32 pages
Silent Period
No ratings yet
Silent Period
27 pages
Work-Readiness Integrated Competence Model Scale For Filipino Graduates
No ratings yet
Work-Readiness Integrated Competence Model Scale For Filipino Graduates
12 pages
Northwestern Engineering Graduate Program Guide
No ratings yet
Northwestern Engineering Graduate Program Guide
21 pages
Analogy Test
No ratings yet
Analogy Test
7 pages
30565-Enhancing The Quality of Student Learning Using Distributed Practice
No ratings yet
30565-Enhancing The Quality of Student Learning Using Distributed Practice
35 pages
Lesson 5 Construction of Written Test
No ratings yet
Lesson 5 Construction of Written Test
47 pages
Posner, M. & Rothbart, M. (2007) Research On Attention Networks As A Model For The Integration of Psychological Science. Annu. Rev. Psychol., 581.23.
No ratings yet
Posner, M. & Rothbart, M. (2007) Research On Attention Networks As A Model For The Integration of Psychological Science. Annu. Rev. Psychol., 581.23.
27 pages
Benchmarks Physical Education: March 2017
No ratings yet
Benchmarks Physical Education: March 2017
27 pages
KardiaChain Company Profile
No ratings yet
KardiaChain Company Profile
25 pages
Llamazares y Arias 2022 Emotional Pedagogy Blind Students
No ratings yet
Llamazares y Arias 2022 Emotional Pedagogy Blind Students
19 pages
ZT CLUB Dialogue With The Voxel X Network CEO Tim Analysis of Voxel
No ratings yet
ZT CLUB Dialogue With The Voxel X Network CEO Tim Analysis of Voxel
9 pages
Our 8 Sensory Systems
No ratings yet
Our 8 Sensory Systems
3 pages
Information Management
No ratings yet
Information Management
16 pages
Module Introduction - P1 - 23 - 24
No ratings yet
Module Introduction - P1 - 23 - 24
10 pages
Bloom S Taxonomy
No ratings yet
Bloom S Taxonomy
20 pages
Bitcoin Pizza Day 03
No ratings yet
Bitcoin Pizza Day 03
6 pages
Language Thought and Color Recent Developments
No ratings yet
Language Thought and Color Recent Developments
9 pages
Categories of Knowledge 1 1 1
No ratings yet
Categories of Knowledge 1 1 1
12 pages
Emotional Intelligence
No ratings yet
Emotional Intelligence
6 pages
Is Digital Amnesia Real
No ratings yet
Is Digital Amnesia Real
3 pages
Muller Layer Illsuion
No ratings yet
Muller Layer Illsuion
3 pages
Ponyatie Semanticheskoe Pole
No ratings yet
Ponyatie Semanticheskoe Pole
4 pages
Competitive Programming Roadmap
No ratings yet
Competitive Programming Roadmap
3 pages
Chapter-3-Assessment-Task1 - Galon
No ratings yet
Chapter-3-Assessment-Task1 - Galon
2 pages
EDUENG04-Structuralism Behaviorism
No ratings yet
EDUENG04-Structuralism Behaviorism
5 pages
Beta - Holistic For Edu Traning
No ratings yet
Beta - Holistic For Edu Traning
1 page
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Multi-Agent Systems and Strategic Decision Making: Module CS4760

Uploaded by

Multi-Agent Systems and Strategic Decision Making: Module CS4760

Uploaded by

Module CS4760

Multi-Agent Systems and

• We consider the problem of learning

Goal: Learn how to take actions in order to maximise reward

Control Reward System

The problem: decision making 6

Goal: Learn how to take actions in order to maximise reward

Goal: Learn how to take actions in order to maximise reward

State: angle, angular speed, position, horizontal velocity

State: angle and position of the joints

State: Raw pixel inputs of

State: Position of all pieces

Many more examples:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.