0% found this document useful (0 votes)

3 views

01-intro

The document outlines the course ECE 6254 on Statistical Machine Learning, focusing on learning effective models from data for practical inference and signal processing problems. It covers various learning approaches, including supervised and unsupervised learning, and emphasizes the importance of probabilistic models. Prerequisites include knowledge of probability, linear algebra, and multivariable calculus, with no formal textbook required, and grading based on tests, homework, and projects.

Uploaded by

Mark Davenport

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

01-intro

Uploaded by

Mark Davenport

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

ECE 6254

Statistical Machine Learning

Spring 2024

Mark A. Davenport
Electrical & Computer
Disclaimer!
None of what I just said was written by me…

Me at 11pm last night:

– Lesson learned: If you want comedy (and nonsense), use Bard

Caveats
– I disavow the statement “this course is not all about math and equations”
– Natural language processing and self-driving cars are not going to be a central focus
Statistical machine learning
• How can we
– learn effective models from data?
– apply these models to practical inference and signal processing problems?

• Example problems: classification, regression, prediction, data modeling,

clustering, and data exploration/visualization

• Our approach: statistical inference

• Main subject of this course

– how to reason about and work with probabilistic models to help us make inferences
from data
What is machine learning?
learn: gain or acquire knowledge of or skill in (something) by
study, experience, or being taught

How do we learn that this is a tree?

My daddy told me that a tree is
a perennial plant with an
elongated stem, or trunk,
supporting leaves or branches.

This has a trunk and branches.

Therefore it is a tree.
What is machine learning?
learn: gain or acquire knowledge of or skill in (something) by
study, experience, or being taught

How do we learn that this is a tree?

EXAMPLES!

A good definition of learning for this course:

“using a set of examples to infer
something about an underlying process”
Why learn from data?
Traditional signal processing is “top down”

Given a model for our data, derive the optimal algorithm

A learning approach is more “bottom up”

Given some examples, derive a good algorithm

Sometimes a good model is really hard to derive from first principles

Examples of learning
The Netflix prize (2007)

Predict how a user will rate a movie

10% improvement = $1 million prize

• Some pattern exists

– users do not assign ratings completely at random – if you like Godfather I, you’ll
probably like Godfather II

• It is hard to pin down the pattern mathematically

• We have lots and lots of data

– we know how a user has rated other movies, and we know how other users have
rated this (and other) movies
Examples of learning
• Recommendation systems
• Speech recognition
• Image classification
• Object detection
• Language modeling
• Spam filtering
• Machine translation
• Time series forecasting (traffic, weather, markets, etc.)
• Search
• Fraud detection
• Medical diagnosis
• …
Supervised learning
We are given input data

Each represents a measurement or observation of some natural or man-made

phenomenon
– may be called input, pattern, signal, feature vector, instance, or independent
variable
– the coordinates may be called features, attributes, predictors, or covariates

In the supervised case, we are also given output data

– may be called output, label, response, or dependent variable

The data are called the training data

Supervised learning
We can think of a pair as obeying a (possibly noisy) input-output
relationship

The goal of supervised learning is usually to generalize the input-output

relationship so that we can predict the output associated with a previously
unseen input

The primary supervised learning

problems are
– classification:

– regression:
Unsupervised learning
The inputs are not accompanied by labels

The goal of unsupervised learning is typically not related to future observations

Instead we want to understand that structure in the data sample itself, or to infer
some characteristic of the underlying probability distribution

Examples of unsupervised learning problems include

– clustering
– density estimation
– dimensionality reduction/feature selection
– visualization
– generative modeling
Other variants of learning
• semi-supervised learning
• self-supervised learning
• active learning
• online learning
• reinforcement learning
• anomaly detection
• transfer learning
• multi-task learning
• …

In general, most learning problems can be thought of as variants of traditional

signal processing problems, but where we have no idea (a priori) how to model
our signals
Prerequisites
• Probability
– random variables, expectation, joint distributions, independence, conditional
distributions, Bayes rule, multivariate normal distribution, …

• Linear algebra
– norms, inner products, orthogonality, linear independence, eigenvalues/vectors,
eigenvalue decompositions, …

• Multivariable calculus
– partial derivatives, gradients, the chain rule, …

• Python or similar programming experience (C or MATLAB)

Text
There is no formally required textbook for this course, but I will draw material
heavily from these sources:

A list of other useful books and links to relevant papers will be posted on the
course webpage

Lecture notes and slides will also be posted on the course webpage
Grading
• Pre-test (5%)

• Homework (25%)

• Data challenges (10%)

• Midterm exam (20%)

• Final exam (20%)

• Final project (20%)

Distance learning
Welcome to our online students!

Recorded lectures will be available to all students

(including on-campus students)

I need your help to make this a success

Online resources:
• Course website
• Canvas
• Piazza
A brief interlude

Gradus
Descendo!
Could you learn this trick?
Suppose that
• denotes the color of the card
– 0 = black
– 1 = red
• denotes which card is hidden
– E.g., Ace of Spades, Queen of Hearts, …

You observe me doing this trick many times and form a dataset:

Can you learn a function such that is a reliable predictor of ?

Another approach
You watch me do this trick a couple times and notice I always hand out 5 cards

Suppose you instead consider

Now, can you learn a function such that is a reliable predictor of ?

Is learning even possible?
or: How I learned to stop worrying and love statistics

Supervised learning
Given training data , we would like to learn an (unknown)
function such that for other than

but…

as we have just seen, this is impossible. Without any additional assumptions, we

conclude nothing about except (maybe) for its value on
Probability to the rescue!
Any agreeing with the training data may be possible
but that does not mean that any is equally probable

A short digression
• Suppose that Javier has a biased coin, which lands on heads with some unknown
probability
–
–
• Javier toss the coin times
–

Does tell us anything about ?

What can we learn from ?
Given enough tosses (large ), we expect that

Law of large numbers

Clearly, at least in a very limited sense, we can learn something about from
observations

There is always the possibility that we are totally wrong, but given enough data,
the probability should be very small

Centurians of Rome
No ratings yet
Centurians of Rome
52 pages
Intro_DL_01
No ratings yet
Intro_DL_01
64 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
ML Merge
No ratings yet
ML Merge
145 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
ML 01
No ratings yet
ML 01
24 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
AIML CO - 3,4 NOTES
No ratings yet
AIML CO - 3,4 NOTES
98 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
DataScience Unit1 (+notes)
No ratings yet
DataScience Unit1 (+notes)
56 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Module 1
No ratings yet
Module 1
50 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
lecture17
No ratings yet
lecture17
33 pages
cs419Notes
No ratings yet
cs419Notes
36 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Day 2 Part 1
No ratings yet
Day 2 Part 1
52 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
32 pages
Machine Learning
No ratings yet
Machine Learning
64 pages
AAI Lecture 9 Sp 25
No ratings yet
AAI Lecture 9 Sp 25
26 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
DSA5102X_lecture1
No ratings yet
DSA5102X_lecture1
51 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
This Story Paraphrased From A Post On 9/4/12
No ratings yet
This Story Paraphrased From A Post On 9/4/12
7 pages
2-Inductive Learning
No ratings yet
2-Inductive Learning
37 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Machine Learning - Week 1
No ratings yet
Machine Learning - Week 1
1 page
Introduction To ML
100% (1)
Introduction To ML
39 pages
unit 01
No ratings yet
unit 01
32 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
52 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Chapter 5 - Machine Learning
No ratings yet
Chapter 5 - Machine Learning
59 pages
Notes
No ratings yet
Notes
125 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Machine Learning Theory CSE 250C: Introductory Lecture
No ratings yet
Machine Learning Theory CSE 250C: Introductory Lecture
29 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
CS168: The Modern Algorithmic Toolbox Lecture #5: Generalization (Or, How Much Data Is Enough?)
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #5: Generalization (Or, How Much Data Is Enough?)
16 pages
Ch7 Introduction to Machine Learning
No ratings yet
Ch7 Introduction to Machine Learning
29 pages
nn
No ratings yet
nn
24 pages
Successful Learning Simplified: Study Skills, #4
From Everand
Successful Learning Simplified: Study Skills, #4
Fiona McPherson
No ratings yet
egghead's Guide to Calculus
From Everand
egghead's Guide to Calculus
Cara Cantarella
No ratings yet
35
No ratings yet
35
3 pages
Religion and Politics: Bangladesh Perspective: Md. Didarul Islam, Fazrin Huda
No ratings yet
Religion and Politics: Bangladesh Perspective: Md. Didarul Islam, Fazrin Huda
5 pages
ICE Prioritization Brainstorm Presentation
No ratings yet
ICE Prioritization Brainstorm Presentation
112 pages
p4 Science Exercise Final Semester 1
No ratings yet
p4 Science Exercise Final Semester 1
3 pages
HyperPersonalization Acquisition
No ratings yet
HyperPersonalization Acquisition
5 pages
Padre Pio, The Saint of The Stigmata
No ratings yet
Padre Pio, The Saint of The Stigmata
7 pages
Reflective Practice Writing and Professional Development 2001
No ratings yet
Reflective Practice Writing and Professional Development 2001
1 page
Hepatocellular Carcinoma: Clinical Practice Guidelines
No ratings yet
Hepatocellular Carcinoma: Clinical Practice Guidelines
60 pages
Digitization of Indigenous Knowledge Systems in Africa The Case of South Africa's National Recorded System (NRS)
No ratings yet
Digitization of Indigenous Knowledge Systems in Africa The Case of South Africa's National Recorded System (NRS)
15 pages
Oposa V. Factoran Davide, Jr. - July 30, 1993 - G.R. No. 101083 Summary/Doctrine: Specified Cause of Action
No ratings yet
Oposa V. Factoran Davide, Jr. - July 30, 1993 - G.R. No. 101083 Summary/Doctrine: Specified Cause of Action
4 pages
Demonetization
No ratings yet
Demonetization
27 pages
Slide RTD FGD LUTS-OAB Final Persentasi
No ratings yet
Slide RTD FGD LUTS-OAB Final Persentasi
38 pages
Columns - Homework Solutions 2 PDF
No ratings yet
Columns - Homework Solutions 2 PDF
6 pages
English Grade 5
No ratings yet
English Grade 5
4 pages
Abdullah 2011
No ratings yet
Abdullah 2011
16 pages
Datasheet PDF
No ratings yet
Datasheet PDF
66 pages
5 Ways To A Monster Guitar Technique
No ratings yet
5 Ways To A Monster Guitar Technique
7 pages
Construction of Residential Building (2Bhk) : A Project Report On
0% (1)
Construction of Residential Building (2Bhk) : A Project Report On
28 pages
Eager Optimized - OCR
100% (3)
Eager Optimized - OCR
127 pages
A. Choose The Correct Verb From The List Below To Complete The Following Sentences. Put The Verb in The Past Perfect Tense (Had & Past Participle)
No ratings yet
A. Choose The Correct Verb From The List Below To Complete The Following Sentences. Put The Verb in The Past Perfect Tense (Had & Past Participle)
3 pages
Polymyxin Antibiotics - Clinical Update
No ratings yet
Polymyxin Antibiotics - Clinical Update
43 pages
6 Mystery of The Talking Fan
No ratings yet
6 Mystery of The Talking Fan
5 pages
CHOLERIC
No ratings yet
CHOLERIC
2 pages
Pathology of Acute Myocardial Infarction
No ratings yet
Pathology of Acute Myocardial Infarction
13 pages
The Power of Knowledge How Information and Technology Made The Modern World
No ratings yet
The Power of Knowledge How Information and Technology Made The Modern World
505 pages
Airhart Lucas cc401 Case Study
No ratings yet
Airhart Lucas cc401 Case Study
16 pages
The Arts of Video Technology
No ratings yet
The Arts of Video Technology
28 pages
Introduction To Integers: Name
No ratings yet
Introduction To Integers: Name
2 pages
Physics 112
No ratings yet
Physics 112
23 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

01-intro

Uploaded by

01-intro

Uploaded by

ECE 6254

Statistical Machine Learning

Me at 11pm last night:

– Lesson learned: If you want comedy (and nonsense), use Bard

• Example problems: classification, regression, prediction, data modeling,

• Our approach: statistical inference

• Main subject of this course

How do we learn that this is a tree?

This has a trunk and branches.

How do we learn that this is a tree?

A good definition of learning for this course:

Given a model for our data, derive the optimal algorithm

A learning approach is more “bottom up”

Given some examples, derive a good algorithm

Sometimes a good model is really hard to derive from first principles

Predict how a user will rate a movie

• Some pattern exists

• It is hard to pin down the pattern mathematically

• We have lots and lots of data

Each represents a measurement or observation of some natural or man-made

In the supervised case, we are also given output data

The data are called the training data

The goal of supervised learning is usually to generalize the input-output

The primary supervised learning

The goal of unsupervised learning is typically not related to future observations

Examples of unsupervised learning problems include

In general, most learning problems can be thought of as variants of traditional

• Python or similar programming experience (C or MATLAB)

• Data challenges (10%)

• Midterm exam (20%)

• Final exam (20%)

• Final project (20%)

Recorded lectures will be available to all students

I need your help to make this a success

Can you learn a function such that is a reliable predictor of ?

Suppose you instead consider

Now, can you learn a function such that is a reliable predictor of ?

as we have just seen, this is impossible. Without any additional assumptions, we

Does tell us anything about ?

Law of large numbers

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.