0% found this document useful (0 votes)

3 views

Lec18 HMMs

The document outlines upcoming guest lectures on LLM development and AI policy, followed by a detailed discussion on Hidden Markov Models (HMMs) in the context of artificial intelligence. It covers topics such as reasoning over time, Markov chains, and the application of HMMs in various fields including speech recognition and robot localization. The document also includes examples and algorithms related to filtering and inference in HMMs.

Uploaded by

23020011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Lec18 HMMs

Uploaded by

23020011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Announcements

§ Guest Lectures Announced!

§ Tuesday, Nov 19: Catherine Olsson (Anthropic) on LLM development &

interpretability

§ Thursday, Dec 3: Miles Brundage (formerly OpenAI) on AI policy and

social impacts
CS 188: Artificial Intelligence
Hidden Markov Models

University of California, Berkeley

[These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.]
Reasoning over Time or Space

§ Often, we want to reason about a sequence of observations

§ Speech recognition
§ Robot localization
§ User attention
§ Medical monitoring
§ Language understanding

§ Need to introduce time (or space) into our models and

update beliefs based on:
§ Getting more evidence (we did this with BNs)
§ World changing over time/space (new this week)
Motivating Example: Pacman Sonar
Today’s Topics

§ Quick probability recap

§ Markov Chains & their Stationary Distributions
§ How beliefs about state change with passage of time

§ Hidden Markov Models (HMMs) formulation

§ How beliefs change with passage of time and evidence

§ Filtering with HMMs

§ How to infer beliefs from evidence
Probability Recap

§ Conditional probability

§ Marginal probability

§ Product rule

§ Chain rule
Probability Recap

§ X, Y independent if and only if:

§ X and Y are conditionally independent given Z if and only if:

§ Proportionality: 𝑃 𝑋 ∝ 𝑓 𝑋 or 𝑃 𝑋 ∝! 𝑓 𝑋 means 𝑃 𝑋 = 𝑘𝑓 𝑋 (for some

" !
constant k that doesn’t depend on X). Equivalent to: 𝑃 𝑋 = ∑ "(%)
!

§ Example: 𝑋 ∝𝑓 𝑋 𝑃 𝑋
𝑥! 0.4 0.4 / (0.4 + 0.2)
𝑥" 0.2 0.2 / (0.4 + 0.2)
Markov Models
§ Value of X at a given time is called the state

X1 X2 X3 X4

§ Parameters: called transition probabilities or dynamics, specify how the state

evolves over time (also, initial state probabilities)
§ Stationarity assumption: transition probabilities the same at all times
§ Same as MDP transition model, but no choice of action
§ A "growable” BN (can always use BN methods if we truncate to fixed length)
Conditional Independence

X1 X2 X3 X4

§ Basic conditional independence:

§ Past and future independent given the present
§ Each time step only depends on the previous
§ This is called the (first order) Markov property
Example Markov Chain: Weather
§ States: X = {rain, sun}

§ Initial distribution: 1.0 sun

§ CPT P(Xt | Xt-1): Two new ways of representing the same CPT

Xt-1 Xt P(Xt|Xt-1) 0.9

0.3
sun sun 0.9 0.9
sun sun
sun rain 0.1 rain sun 0.1
rain sun 0.3 0.3
rain rain
rain rain 0.7 0.7 0.7
0.1
Example Markov Chain: Weather
Xt-1 Xt P(Xt|Xt-1)
§ Initial distribution: 1.0 sun sun sun 0.9
§ We know: sun rain 0.1
rain sun 0.3
rain rain 0.7

§ What is the probability distribution after one step?

P( X2 = sun) = Â P(x1 , X2 = sun) = Â P(X2 = sun| x1 ) P(x1 )

x1 x1
Mini-Forward Algorithm

§ Question: What’s P(X) on some day t?

X1 X2 X3 X4 Xt ?

§ We know 𝑃(𝑋& ) and 𝑃 𝑋' 𝑋'(& )

X
P (xt ) = P (xt 1 , xt )
x
X
t 1

= P (xt | xt 1 )P (xt 1 )
xt 1
Forward simulation
Example Run of Mini-Forward Algorithm
§ From initial observation of sun

P(X1) P(X2) P(X3) P(X4) P(X¥)

§ From initial observation of rain

P(X1) P(X2) P(X3) P(X4) P(X¥)

§ From yet another initial distribution P(X1):

…
P(X1) P(X¥) [Demo: L13D1,2,3]
Video of Demo Ghostbusters Basic Dynamics
Video of Demo Ghostbusters Circular Dynamics
Video of Demo Ghostbusters Whirlpool Dynamics
Stationary Distributions

§ For most chains: § Stationary distribution:

§ Influence of the initial distribution § The distribution we end up with is called
gets less and less over time. the stationary distribution P1of the
§ The distribution we end up in is chain
independent of the initial distribution § It satisfies
X
P1 (X) = P1+1 (X) = P (X|x)P1 (x)
x
Example: Stationary Distributions
§ Question: What’s P(X) at time t = infinity?
X1 X2 X3 X4 X∞ ?
X
P1 (X) = P1+1 (X) = P (X|x)P1 (x)
x
P1 (sun) = P (sun|sun)P1 (sun) + P (sun|rain)P1 (rain)
P1 (rain) = P (rain|sun)P1 (sun) + P (rain|rain)P1 (rain)
Xt-1 Xt P(Xt|Xt-1)
P1 (sun) = 0.9P1 (sun) + 0.3P1 (rain) sun sun 0.9
P1 (rain) = 0.1P1 (sun) + 0.7P1 (rain) sun rain 0.1
rain sun 0.3
P1 (sun) = 3P1 (rain) P1 (sun) = 3/4 rain rain 0.7
PAlso:
1 (rain)
P1=(sun)
1/3P1+(sun)
P1 (rain) = 1 P1 (rain) = 1/4
§ Alternatively: run simulation for a long (ideally infinite) time
Application of Stationary Distribution: Web Link Analysis
§ PageRank over a web graph
§ Each web page is a state
§ Initial distribution: uniform over pages
§ Transitions:
§ With prob. c, uniform jump to a
random page (dotted lines, not all shown)
§ With prob. 1-c, follow a random
outlink (solid lines)

§ Stationary distribution
§ Will spend more time on highly reachable pages
§ E.g. many ways to get to the Acrobat Reader download page
§ Somewhat robust to link spam
§ Google 1.0 returned the set of pages containing all your
keywords in decreasing rank, now all search engines use link
analysis along with many other factors (rank actually getting
less important over time)
Hidden Markov Models
Pacman – Sonar

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

Video of Demo Pacman – Sonar (no beliefs)
Video of Demo Pacman – Sonar (with beliefs)
Hidden Markov Models
§ Markov chains not so useful for most agents

X1 X2 X3 X4 X5
Hidden Markov Models
§ Markov chains not so useful for most agents

X1 X2 X3 X4 X5

§ Need observations to update your beliefs

§ Hidden Markov models (HMMs)

§ Underlying Markov chain over states X
§ You observe outputs (effects) at each time step

X1 X2 X3 X4 X5

E1 E2 E3 E4 E5
Example: Weather HMM
P (Xt | Xt 1)

Raint-1 Raint Raint+1

P (Et | Xt )

Umbrellat-1 Umbrellat Umbrellat+1

Transitions Emissions
§ An HMM is defined by: Rt-1 Rt P(Rt|Rt-1) Rt Ut P(Ut|Rt)
+r +r 0.7 +r +u 0.9
§ Initial distribution:
+r -r 0.3 +r -u 0.1
§ Transitions: P (Xt | Xt 1 ) -r +r 0.3 -r +u 0.2
§ Emissions: P (Et | Xt ) -r -r 0.7 -r -u 0.8
Example: Ghostbusters HMM

§ P(X1) = uniform 1/9 1/9 1/9

1/9 1/9 1/9

§ P(X’|X) = usually move clockwise, but
sometimes move in a random direction or 1/9 1/9 1/9
stay in place
P(X1)

§ P(Rij|X) = same sensor model as before:

1/6 1/6 1/2
red means close, green means far away.
0 1/6 0

X1 X2 X3 X4 0 0 0

X5 P(X’|X = <1,2>)

Ri,j Ri,j Ri,j Ri,j

[Demo: Ghostbusters – Circular Dynamics – HMM (L14D2)]
Video of Demo Ghostbusters – Circular Dynamics -- HMM
Conditional Independence
§ HMMs have two important independence properties:
§ Markov hidden process: future depends on past via the present
§ Current observation independent of all else given current state

X1 X2 X3 X4 X5

E1 E2 E3 E4 E5

§ Does this mean that evidence variables are guaranteed to be independent?

Conditional Independence
§ HMMs have two important independence properties:
§ Markov hidden process: future depends on past via the present
§ Current observation independent of all else given current state

X1 X2 X3 X4 X5

E1 E2 E3 E4 E5

§ Does this mean that evidence variables are guaranteed to be independent?

§ No, they are correlated by the hidden state
Real HMM Examples
§ Speech recognition HMMs:
§ Observations are acoustic signals (continuous valued)
§ States are specific positions in specific words (so, tens of thousands)

§ Machine translation HMMs:

§ Observations are words (tens of thousands)
§ States are translation options

§ Robot tracking:
§ Observations are range readings (continuous)
§ States are positions on a map (continuous) X1 X2 X3 X4 X

E1 E2 E3 E4 E
Filtering / Monitoring

X1 X2 X3 X4 Xt Find: P(Xt | e1, …, et) = Bt(X)

E1 E2 E3 E4 Et Observe

§ Filtering, or monitoring, is the task of tracking the distribution

Bt(X) = Pt(Xt | e1, …, et) (the belief state) over time
§ We start with B1(X) in an initial setting, usually uniform
§ As time passes, or we get observations, we update B(X)

§ The Kalman filter was invented in the 60’s and first

implemented as a method of trajectory estimation for the
Apollo program
Example: Robot Localization
Example from
Michael Pfeiffer

Prob 0 1
t=0
Sensor model: can read in which directions there is a wall,
never more than 1 mistake
Motion model: may not execute action with small prob.
Example: Robot Localization

Prob 0 1
t=1
Lighter grey: was possible to get the reading, but less likely b/c
required 1 mistake
Example: Robot Localization

Prob 0 1

t=2
Example: Robot Localization

Prob 0 1

t=3
Example: Robot Localization

Prob 0 1

t=4
Example: Robot Localization

Prob 0 1

t=5
HMM Inference: Find State Given Evidence

§ We are given evidence at each time and want to know

𝐵! 𝑋 = 𝑃(𝑋! |𝑒":! )

§ Idea: start with 𝑃(𝑋1) and derive 𝐵' (𝑋) in terms of 𝐵'()(𝑋)
§ Two steps: Passage of Time & Observation
𝐵$ % 𝑋 = 𝑃(𝑋% |𝑒!:# )

X1 X2 X3 X4

E1 E2 E3 E4

𝐵# (𝑋) 𝐵% 𝑋 = 𝑃(𝑋% |𝑒!:% )

Inference: Base Cases

Passage of Time: Observation:

X1 X2
E1
Passage of Time: Base Case

X1 X2

Have: 𝑃(𝑋& ) 𝑃(𝑋/ |𝑋& )

Want:
Passage of Time: General Case
§ Assume we have current belief P(X | evidence to date)
X1 X2

§ Then, after one time step passes:

§ Basic idea: beliefs get “pushed” through the transitions

§ With the “B” notation, we have to be careful about what time step t the belief is about, and what
evidence it includes
Example: Passage of Time
§ As time passes, uncertainty “accumulates” (Transition model: ghosts usually go clockwise)

T=1 T=2 T=5

Observation: Base Case

Have: 𝑃(𝑋& ) 𝑃(𝐸& |𝑋& )

= P (et+1 |e1:t , Xt+1 )P (Xt+1 |e1:t )

= P (et+1 |Xt+1 )P (Xt+1 |e1:t )

§ Basic idea: beliefs “reweighted”
§ Or, compactly: by likelihood of evidence
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 ) § Unlike passage of time, we have
to renormalize
Example: Observation
§ As we get observations, beliefs get reweighted, uncertainty “decreases”

Before observation After observation

Two Steps: Passage of Time + Observation

XX
00 00
BB(X
(Xt+1
t+1))== PP (X
(X |x|xt )B(x
t )B(x
t)t)
xx
tt
X1 X2 X3 X4 X5

E1 E2 E3 E4 B(X E /X PP(e
B(Xt+1
t+15)) / (e t+1 |X
|X t+1 )B
)B 00 (X
(Xt+1t+1))
Xt+1
t+1 t+1 t+1
Pacman – Sonar

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

Video of Demo Pacman – Sonar (with beliefs)
Example: Weather HMM
Passage of Time:X
X
0
X 0
B B 0t+1
(X =
(X)t+1
=) = P (X
(X |xt0)B(x
P t+1
(X |x
|xt )Pt )(xtt)|e1:t )
)B(x
B’(+r) = ?
B’(-r) = ? xxtt xt
Observation:
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 )
B(+r) = 0.5
B(-r) = 0.5

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
Umbrella1 Umbrella2 +r -r 0.3 +r -u 0.1
-r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
Example: Weather HMM
Passage of Time:X
X
0
X 0
B B 0t+1
(X =
(X)t+1
=) = P (X
(X |xt0)B(x
P t+1
(X |x
|xt )Pt )(xtt)|e1:t )
)B(x
B’(+r) = 0.5*0.7 + 0.5*0.3 = 0.5
B’(-r) = 0.5*0.3 + 0.5*0.7 = 0.5 xxtt xt
Observation:
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 )
B(+r) = 0.5
B(-r) = 0.5

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
Umbrella1 Umbrella2 +r -r 0.3 +r -u 0.1
-r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
Example: Weather HMM
Passage of Time:X
X
0
X 0
B B 0t+1
(X =
(X)t+1
=) = P (X
(X |xt0)B(x
P t+1
(X |x
|xt )Pt )(xtt)|e1:t )
)B(x
B’(+r) = 0.5
B’(-r) = 0.5 xxtt xt
Observation:
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 )
B(+r) = 0.5 B(+r) = ?
B(-r) = 0.5 B(-r) = ?

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
Umbrella1 Umbrella2 +r -r 0.3 +r -u 0.1
-r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
Example: Weather HMM
Passage of Time:X
X
0
X 0
B B 0t+1
(X =
(X)t+1
=) = P (X
(X |xt0)B(x
P t+1
(X |x
|xt )Pt )(xtt)|e1:t )
)B(x
B’(+r) = 0.5
B’(-r) = 0.5 xxtt xt
Observation:
normalize
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 )
B(+r) = 0.5 B(+r) = 0.9*0.5 = 0.45 0.818
B(-r) = 0.5 B(-r) = 0.2*0.5 = 0.10 0.182

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
Umbrella1 Umbrella2 +r -r 0.3 +r -u 0.1
-r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
Example: Weather HMM
Passage of Time:X
X
0
X 0
B B 0t+1
(X =
(X)t+1
=) = P (X
(X |xt0)B(x
P t+1
(X |x
|xt )Pt )(xtt)|e1:t )
)B(x
B’(+r) = 0.5 B’(+r) = 0.627
B’(-r) = 0.5 B’(-r) = 0.373 xxtt xt
Observation:
B(Xt+1 ) /Xt+1 P (et+1 |Xt+1 )B 0 (Xt+1 )
B(+r) = 0.5 B(+r) = 0.818 B(+r) = 0.883
B(-r) = 0.5 B(-r) = 0.182 B(-r) = 0.117

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
Umbrella1 Umbrella2 +r -r 0.3 +r -u 0.1
-r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
What we did today

§ Markov Chains & their Stationary Distributions

§ How beliefs about state change with passage of time

§ Hidden Markov Models (HMMs) formulation

§ How beliefs change with passage of time and evidence

§ Filtering with HMMs

§ How to infer beliefs from evidence
Next Time: More Filtering!

Re-Creating Vintage Designs On T-Shirts: Michael Ploch
No ratings yet
Re-Creating Vintage Designs On T-Shirts: Michael Ploch
12 pages
24f_09_hidden_markov_models
No ratings yet
24f_09_hidden_markov_models
79 pages
SP14 CS188 Lecture 14 -- Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 -- Hidden Markov Models - Print
26 pages
Pert15 - Probabilistic Reasoning Over Time
No ratings yet
Pert15 - Probabilistic Reasoning Over Time
32 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
Slides
No ratings yet
Slides
69 pages
11 Probabilistic Temporal Models
No ratings yet
11 Probabilistic Temporal Models
60 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Hidden Markov Models Applied To Information Extraction: Part I: Concept
No ratings yet
Hidden Markov Models Applied To Information Extraction: Part I: Concept
34 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Chapter15 1
No ratings yet
Chapter15 1
36 pages
CHAPTER_14
No ratings yet
CHAPTER_14
38 pages
ML 5
No ratings yet
ML 5
28 pages
Markov Models
No ratings yet
Markov Models
54 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Lectures 7 and 8
No ratings yet
Lectures 7 and 8
37 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
09 - Hidden Markov Model
No ratings yet
09 - Hidden Markov Model
78 pages
CSE 473: Ar+ficial Intelligence: Bayes' Nets
No ratings yet
CSE 473: Ar+ficial Intelligence: Bayes' Nets
26 pages
Hidden Markov Models: Julia Hirschberg CS4705
No ratings yet
Hidden Markov Models: Julia Hirschberg CS4705
37 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
斯坦福大学机器学习数学基础 65-72
No ratings yet
斯坦福大学机器学习数学基础 65-72
8 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Hidden Markov Models: A Simple Markov Chain
No ratings yet
Hidden Markov Models: A Simple Markov Chain
46 pages
AML_mod2
No ratings yet
AML_mod2
38 pages
SP14 CS188 Lecture 13 - Markov Models
No ratings yet
SP14 CS188 Lecture 13 - Markov Models
33 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
Unit 4 Full PPT (ML)
No ratings yet
Unit 4 Full PPT (ML)
31 pages
Module 4.2
No ratings yet
Module 4.2
42 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
9 pages
Applications of Hidden Markov Model Stat-1
No ratings yet
Applications of Hidden Markov Model Stat-1
8 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
No ratings yet
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
41 pages
Introduction To Machine Learning CMU-10701: Hidden Markov Models
No ratings yet
Introduction To Machine Learning CMU-10701: Hidden Markov Models
30 pages
17 19 HMMs
No ratings yet
17 19 HMMs
23 pages
Probabilistic Models
No ratings yet
Probabilistic Models
34 pages
T02640020220124069T0264-14 Probabilistic Reasoning Over Time
No ratings yet
T02640020220124069T0264-14 Probabilistic Reasoning Over Time
32 pages
Lec8_Bayesian Network II(1)
No ratings yet
Lec8_Bayesian Network II(1)
50 pages
2024-Fall-CSE366-12-HMM
No ratings yet
2024-Fall-CSE366-12-HMM
46 pages
Знімок екрана 2022-10-31 о 18.56.30
No ratings yet
Знімок екрана 2022-10-31 о 18.56.30
96 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
19 pages
PAC-Learning of Markov Models With Hidden State
No ratings yet
PAC-Learning of Markov Models With Hidden State
12 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
Ad8601-Unit Ii Probabilistic Reasoning Ii
No ratings yet
Ad8601-Unit Ii Probabilistic Reasoning Ii
26 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
cs229 HMM
No ratings yet
cs229 HMM
13 pages
MLRD 8
No ratings yet
MLRD 8
39 pages
Module 3 (1)
No ratings yet
Module 3 (1)
9 pages
Machine Learning For Natural Language Processing: Hidden Markov Models
No ratings yet
Machine Learning For Natural Language Processing: Hidden Markov Models
33 pages
Hidden Markov Model HMM
No ratings yet
Hidden Markov Model HMM
11 pages
Lecture Week11
No ratings yet
Lecture Week11
24 pages
8.1 HMM
No ratings yet
8.1 HMM
50 pages
Dynamic Bayesian Networks: Kevin P. Murphy WWW - Ai.mit - Edu/ Murphyk 12 November 2002
No ratings yet
Dynamic Bayesian Networks: Kevin P. Murphy WWW - Ai.mit - Edu/ Murphyk 12 November 2002
55 pages
Unit3pdf__2025_01_14_10_38_08
No ratings yet
Unit3pdf__2025_01_14_10_38_08
4 pages
Hidden Markov Models and Sequential Data
No ratings yet
Hidden Markov Models and Sequential Data
45 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Fundamentals of physics
From Everand
Fundamentals of physics
Alessio Mangoni
2/5 (1)
City In the Sky
From Everand
City In the Sky
Richard Lighthouse
No ratings yet
cs188-fa24-lec25
No ratings yet
cs188-fa24-lec25
76 pages
Lec19-Particle Filtering and Applications of HMMs
No ratings yet
Lec19-Particle Filtering and Applications of HMMs
42 pages
lec21-ML II
No ratings yet
lec21-ML II
66 pages
lec17-Decision Networks VPI
No ratings yet
lec17-Decision Networks VPI
25 pages
lec16-Bayes Nets Sampling
No ratings yet
lec16-Bayes Nets Sampling
27 pages
cs188-fa24-lec23
No ratings yet
cs188-fa24-lec23
60 pages
Pengembangan Dan Validasi Metode Analisis Alopurinol Tablet Dengan Metode Absorbansi Dan Metode Luas Daerah Di Bawah Kurva Secara Spektrofotometri Ultraviolet
No ratings yet
Pengembangan Dan Validasi Metode Analisis Alopurinol Tablet Dengan Metode Absorbansi Dan Metode Luas Daerah Di Bawah Kurva Secara Spektrofotometri Ultraviolet
12 pages
Introduction To Classical Test Theory
No ratings yet
Introduction To Classical Test Theory
32 pages
Dynamic Learning Program (DLP 2) Applied Economics
No ratings yet
Dynamic Learning Program (DLP 2) Applied Economics
5 pages
Fluid Exam 2016
No ratings yet
Fluid Exam 2016
20 pages
CHM 231 Organi Chem 1
No ratings yet
CHM 231 Organi Chem 1
101 pages
Week 5 Assignment For IT 332 Database Management
No ratings yet
Week 5 Assignment For IT 332 Database Management
6 pages
m1 Bronze 1
No ratings yet
m1 Bronze 1
12 pages
Chiziqli x
No ratings yet
Chiziqli x
413 pages
Heat and Mass Transfer Preface PDF
No ratings yet
Heat and Mass Transfer Preface PDF
8 pages
BPS Branes in Supergravity
No ratings yet
BPS Branes in Supergravity
99 pages
Tinawi M. Manual of Fluid, Electrolyte, And Acid-Base Disorders...2ed 2024
No ratings yet
Tinawi M. Manual of Fluid, Electrolyte, And Acid-Base Disorders...2ed 2024
363 pages
Mini Project Slide - Nazim, Hasif, Iwani, Syakir
No ratings yet
Mini Project Slide - Nazim, Hasif, Iwani, Syakir
21 pages
Thermodynamics Report
No ratings yet
Thermodynamics Report
5 pages
Application of Base Isolation To A Large Hospital in Naples, Italy
No ratings yet
Application of Base Isolation To A Large Hospital in Naples, Italy
12 pages
1803.04311 - Transaction Survey
No ratings yet
1803.04311 - Transaction Survey
53 pages
Manuale Hydra-Hydrahp 670703270 GB
No ratings yet
Manuale Hydra-Hydrahp 670703270 GB
56 pages
Unit VTO (Version 4.3) - User's Manual - V1.0.0
No ratings yet
Unit VTO (Version 4.3) - User's Manual - V1.0.0
44 pages
Solid Meachanics Materials
No ratings yet
Solid Meachanics Materials
4 pages
RacingStar RS16 Charger Manual
No ratings yet
RacingStar RS16 Charger Manual
20 pages
Agitatordesignandselection 131108020731 Phpapp01
No ratings yet
Agitatordesignandselection 131108020731 Phpapp01
33 pages
Dairy Equipment For Milk Products Processing
No ratings yet
Dairy Equipment For Milk Products Processing
18 pages
SL1991 289
No ratings yet
SL1991 289
21 pages
10 PHY ICSE X Electro Magnetism
No ratings yet
10 PHY ICSE X Electro Magnetism
11 pages
Dimensional Analysis
No ratings yet
Dimensional Analysis
34 pages
Copper Cam
No ratings yet
Copper Cam
7 pages
Hodge 2003
No ratings yet
Hodge 2003
23 pages
Periodic Table HD
No ratings yet
Periodic Table HD
2 pages
Unit 3-Numerical Methods For Solving Equations
No ratings yet
Unit 3-Numerical Methods For Solving Equations
20 pages
07 Membrane Separation
No ratings yet
07 Membrane Separation
36 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec18 HMMs

Uploaded by

Lec18 HMMs

Uploaded by

Announcements

§ Guest Lectures Announced!

§ Tuesday, Nov 19: Catherine Olsson (Anthropic) on LLM development &

§ Thursday, Dec 3: Miles Brundage (formerly OpenAI) on AI policy and

University of California, Berkeley

§ Often, we want to reason about a sequence of observations

§ Need to introduce time (or space) into our models and

§ Quick probability recap

§ Hidden Markov Models (HMMs) formulation

§ Filtering with HMMs

§ X, Y independent if and only if:

§ X and Y are conditionally independent given Z if and only if:

§ Proportionality: 𝑃 𝑋 ∝ 𝑓 𝑋 or 𝑃 𝑋 ∝! 𝑓 𝑋 means 𝑃 𝑋 = 𝑘𝑓 𝑋 (for some

§ Parameters: called transition probabilities or dynamics, specify how the state

§ Basic conditional independence:

§ Initial distribution: 1.0 sun

Xt-1 Xt P(Xt|Xt-1) 0.9

§ What is the probability distribution after one step?

P( X2 = sun) = Â P(x1 , X2 = sun) = Â P(X2 = sun| x1 ) P(x1 )

§ Question: What’s P(X) on some day t?

§ We know 𝑃(𝑋& ) and 𝑃 𝑋' 𝑋'(& )

P(X1) P(X2) P(X3) P(X4) P(X¥)

P(X1) P(X2) P(X3) P(X4) P(X¥)

§ For most chains: § Stationary distribution:

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

§ Need observations to update your beliefs

§ Hidden Markov models (HMMs)

Raint-1 Raint Raint+1

Umbrellat-1 Umbrellat Umbrellat+1

§ P(X1) = uniform 1/9 1/9 1/9

1/9 1/9 1/9

§ P(Rij|X) = same sensor model as before:

Ri,j Ri,j Ri,j Ri,j

§ Does this mean that evidence variables are guaranteed to be independent?

§ Does this mean that evidence variables are guaranteed to be independent?

§ Machine translation HMMs:

X1 X2 X3 X4 Xt Find: P(Xt | e1, …, et) = Bt(X)

§ Filtering, or monitoring, is the task of tracking the distribution

§ The Kalman filter was invented in the 60’s and first

§ We are given evidence at each time and want to know

𝐵# (𝑋) 𝐵% 𝑋 = 𝑃(𝑋% |𝑒!:% )

Passage of Time: Observation:

Have: 𝑃(𝑋& ) 𝑃(𝑋/ |𝑋& )

§ Then, after one time step passes:

§ Basic idea: beliefs get “pushed” through the transitions

T=1 T=2 T=5

Have: 𝑃(𝑋& ) 𝑃(𝐸& |𝑋& )

= P (et+1 |e1:t , Xt+1 )P (Xt+1 |e1:t )

= P (et+1 |Xt+1 )P (Xt+1 |e1:t )

Before observation After observation

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

Rain0 Rain1 Rain2 𝑃(𝑋!"# |𝑋! ) 𝑃(𝐸! |𝑋! )

Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

§ Markov Chains & their Stationary Distributions

§ Hidden Markov Models (HMMs) formulation

§ Filtering with HMMs

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.