0% found this document useful (0 votes)

135 views4 pages

2021 EE769 Tutorial Sheet 1

This document provides an introduction to basic mathematics concepts for machine learning, including vectors, matrices, functions, probability distributions, and gradient descent. It contains examples and exercises related to computing dot products, norms, determinants, eigenvalues, means, standard deviations, conditional probabilities, and applying concepts like continuity, convexity/concavity to functions.

Uploaded by

raktion

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

135 views4 pages

2021 EE769 Tutorial Sheet 1

Uploaded by

raktion

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EE 769 Introduction to Machine Learning

Sheet 1 — 2020-21-2
Basic Mathematics for ML

1. Vectors:
(a) Compute the dot product of the given two vectors:
(i)
   
1 1
a = 2 b = 0
   

3 1
(ii) " # " #
1 1
a= b=
1 −1
(b) Find the norm of the following vector:
 
1
a= 0 
 

−1

(c) Find the cosine of the angle between two vectors:

   
1 0
a = 1 , b = −1
   

0 0

Hint: Get unit vectors in the direction of each of the two vectors, compute the dot
product of the two unit vectors. Draw it out.
(d) What is the projection of the the first vector on the to the second one?
" # " #
1 1
a= ,b =
0 1

Hint: Get a unit vector in the direction of the second, compute its dot product
with the first vector, multiply the dot product with the unit vector. Draw it out.

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 1 of 4

Amit Sethi asethi@iitb.ac.in
EE 769 Introduction to Machine Learning: Sheet 1— 2020-21-2

2. Matrices:
(a) Compute the following matrix vector product, if they can be computed.
" #
h i 1 2
(i) 1 2
3 4
  
1 2 1
(ii) 4 5 5
  

0 1 6
(b) Compute the determinant of the following matrix:
" #
0 1
−2 −3
(c) Check if a vector [1, −1]T is an eigenvector of the above given matrix, and if so,
what is the corresponding eigenvalue?
(d) Confirm
# that
" the following#eigenvalue
# " decomposition is valid for the given matrix
" √ √ " √ √ #
5 3 1/ 2 1/ 2 2 0 1/ 2 −1/ 2
= √ √ √ √ by checking that the eigen-
3 5 −1/ 2 1/ 2 0 8 1/ 2 1/ 2
vectors are of norm 1, and the last matrix is the inverse of the first matrix, while
the second matrix is a diagonal matrix. Also confirm that Avi = λi vi for both i.
(e) Find the rank of the following matrices.
 
0 −1 5
(i) 2 4 −6
 

1 1 5
" #
−5 −7
(ii)
5 7
(f) Compute the trace of the matrices above (in part e).

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 2 of 4

Amit Sethi asethi@iitb.ac.in
EE 769 Introduction to Machine Learning: Sheet 1— 2020-21-2

3. Functions: For the following functions, determine if the function is continuous, has a
finite derivative everywhere, has a sub-derivative that exists everywhere (limit of the
derivative from both side exist, and left limit ≤ right limit), has a global maxima
(confirm concavity), has a global minima (confirm convexity), has a local maxima (if
so, then for what value of x), or has a local minima (if so, then for what value of x)?
Make rough drawings to clarify the concepts.
(a) x2 − 2x + 4
(b) −x2 − 2x + 4
(c) x3 − 9x
2
(d) −e−x
p
(e) |x|

4. Python: Set up an ipython notebook in Google CoLab (https://colab.research.

google.com/), import pandas library by typing: import pandas as pd and perform
the following operations:
(a) Read the ”california housing train.csv” file from the sample data folder of colab
environment into a pandas dataframe.
hint: Read the csv file using the method : df=pd.read csv("./location/of/file.csv")
(b) Print the dataframe and extract the ’median income’ and ’population’ columns
from the dataframe.
hint: Try df.head() and df[[’column name1’,’column name2’]]
(c) Compute the mean and standard deviation of the ’median income’ and ’population’
columns.
hint: Try df[’column name’].mean() and df[’column name’].std()
(d) Create a new data frame with zero mean and unit standard deviation columns for
these columns.
hint: new df[’new column name(s)’]= operation to be performed
(e) Write this new dataframe into a new csv file.
hint: Try new df.to csv("./location/to/save/file.csv",index=False)

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 3 of 4

Amit Sethi asethi@iitb.ac.in
EE 769 Introduction to Machine Learning: Sheet 1— 2020-21-2

5. Gradient descent or ascend, and Lagrange multiplier:

(a) For f (x) = x3 − 9x at x = −1 a small positive step is taken (i.e., x ← x + , > 0).
Will such a step lead us closer to a local maxima or a local minima?
(b) For the same function f (x) = x3 − 9x at x = −1, a gradient ascend update is
performed using x ← x + ηf 0 (x), η = 1. Is such a value of η desirable?
(c) For the same function f (x) = x3 − 9x at x = −1, compute the optimal update step
that should be added to x as per Newton’s method and determine if it will reach
the local maxima. If not, then why not?
(d) For the following function, find the expression for the gradient: f (x) = 3x21 + 2x2 +
5x33 + 4x1 x2 , where x = [x1 x2 x3 ]T .
(e) Find the minima of the function f (x) = x21 + x22 subject to the condition g(x) =
(x1 − 1)2 + (x2 − 1)2 − 1 = 0.

6. Probability distributions:
(a) What is the probability mass function of a random variable x that represents the
total number of heads if a fair coin is tossed two times?
(b) Between a fair coin and a biased coin, whose number of heads in two tosses has
higher entropy?
(c) If we map the number of heads in three tosses of a fair coin to variable x, and the
maximum number of consecutive heads in those three tosses to variable y, then
what is their joint probability mass function? Write this as a 2-dimensional table.
(d) For the previous question, compute the marginal distributions of x and y using the
joint PMF table.
(e) What is the conditional PMF of x given y = 1 for part (c)?
(f) Two Gaussian random variables (continuous, of course) x and y both have mean
µx = µy = 0, but σx = 1 while σy = 2. Which of them is more likely to have an
absolute value greater than 3? Use the following approach:
(i) Using the formula for a Gaussian PDF, draw two overlapping graphs in python
using matplotlib library, one for PDF of x, and another for y.
(ii) Observe where the two PDFs cross each other to answer part (f) qualitatively.
(g) Assume that slant of the eyes and size of the nose of an animal is captured as a
two dimensional feature vector. For cats, this vector has a Gaussian distribution

with µcat = 10 and Σcat = 10 01 . For dogs, the distribution is also Gaussian,

but with µdog = 01 and Σdog = 10 01 . Given that an individual animal has the

following slant and nose size 0.5
0 , then is it more likely to be cat or a dog by
simply comparing the PDF values of cats and dogs at that point?

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 4 of 4

Amit Sethi asethi@iitb.ac.in

CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
00-statistics
No ratings yet
00-statistics
18 pages
Data Science
No ratings yet
Data Science
74 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Danka t Mathematics of Machine Learning Master Linear Algebr
No ratings yet
Danka t Mathematics of Machine Learning Master Linear Algebr
729 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Fuzzy Soft Set Theory and Its Applications
No ratings yet
Fuzzy Soft Set Theory and Its Applications
19 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
Lec20 RidgeRegression
No ratings yet
Lec20 RidgeRegression
21 pages
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
No ratings yet
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
18 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
IE506 Bagging Boosting April5 6
No ratings yet
IE506 Bagging Boosting April5 6
14 pages
Unit 2 Preparing To Model
No ratings yet
Unit 2 Preparing To Model
49 pages
Machine Learning: Neural Networks
No ratings yet
Machine Learning: Neural Networks
22 pages
01-Introduction Machine Learning
100% (1)
01-Introduction Machine Learning
48 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Distance Based Models
No ratings yet
Distance Based Models
58 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Topic 37 - Limits & Derivatives of Trig Functions
No ratings yet
Topic 37 - Limits & Derivatives of Trig Functions
8 pages
03 Diversity PDF
No ratings yet
03 Diversity PDF
30 pages
Intro4 ANN Deep CNN PDF
No ratings yet
Intro4 ANN Deep CNN PDF
20 pages
ST2195 Programming For Data Science
No ratings yet
ST2195 Programming For Data Science
11 pages
Build Your Own C Interpreter
No ratings yet
Build Your Own C Interpreter
18 pages
Week 5 Programming Assignment: (Https://swayam - Gov.in)
No ratings yet
Week 5 Programming Assignment: (Https://swayam - Gov.in)
12 pages
Computer Education For Nepali School Students - QBASIC CLASS IX
No ratings yet
Computer Education For Nepali School Students - QBASIC CLASS IX
10 pages
Machine Learning - Stanford University - Coursera
No ratings yet
Machine Learning - Stanford University - Coursera
16 pages
Bias, Variance, and Tradeoff
No ratings yet
Bias, Variance, and Tradeoff
8 pages
Csps 1
100% (1)
Csps 1
62 pages
Uses and application of matrix in real life
No ratings yet
Uses and application of matrix in real life
14 pages
exercise01
No ratings yet
exercise01
3 pages
Cvdict
No ratings yet
Cvdict
309 pages
Slepc
No ratings yet
Slepc
134 pages
Final 12
No ratings yet
Final 12
11 pages
MATH 115: Lecture IV Notes
No ratings yet
MATH 115: Lecture IV Notes
5 pages
2010 Maria Petrou, Costas Petrou (Auth.) Image Processing - The Fundamentals, Second Edition
100% (2)
2010 Maria Petrou, Costas Petrou (Auth.) Image Processing - The Fundamentals, Second Edition
815 pages
Math 203-1.2
No ratings yet
Math 203-1.2
66 pages
Independent Component Analysis: Bhagesh Bhutani (20) Chayan Sharma (21) Deepak
No ratings yet
Independent Component Analysis: Bhagesh Bhutani (20) Chayan Sharma (21) Deepak
15 pages
Types of Solutions
No ratings yet
Types of Solutions
12 pages
ES 1 Handout 2
No ratings yet
ES 1 Handout 2
55 pages
Lesson 1 Tensor Index Notation
No ratings yet
Lesson 1 Tensor Index Notation
16 pages
Math Notation
No ratings yet
Math Notation
20 pages
1A - Systems of Linear Equations
No ratings yet
1A - Systems of Linear Equations
6 pages
Skewed Coordinates
100% (1)
Skewed Coordinates
22 pages
CLASS XII MATH Pre Board Dec 2024
No ratings yet
CLASS XII MATH Pre Board Dec 2024
5 pages
CS 229, Autumn 2016 Problem Set #1: Supervised Learning: m −y θ x m θ (i) (i)
No ratings yet
CS 229, Autumn 2016 Problem Set #1: Supervised Learning: m −y θ x m θ (i) (i)
8 pages
CMSC 56 Course Outline
No ratings yet
CMSC 56 Course Outline
17 pages
Cuestionarios IA
No ratings yet
Cuestionarios IA
17 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Class 12 Maths Preboard 1 Set 2
No ratings yet
Class 12 Maths Preboard 1 Set 2
8 pages
I. The Types of Machine Learning
No ratings yet
I. The Types of Machine Learning
8 pages
Matrix Bhu Class Notes
No ratings yet
Matrix Bhu Class Notes
80 pages
Divide and Conquer 2
No ratings yet
Divide and Conquer 2
6 pages
Grade 10 Mathematics Common Schemes Term 1 To 3
0% (1)
Grade 10 Mathematics Common Schemes Term 1 To 3
4 pages
Model With One-Word Context: 2vec 2vec 2vec 2vec
100% (1)
Model With One-Word Context: 2vec 2vec 2vec 2vec
17 pages
PCA1
No ratings yet
PCA1
45 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
Week 5
No ratings yet
Week 5
8 pages
Hw1 Theory Solution PuHK4fmHvB
No ratings yet
Hw1 Theory Solution PuHK4fmHvB
4 pages
Syllabus Cours Linear Algebra 2023-2024
No ratings yet
Syllabus Cours Linear Algebra 2023-2024
5 pages
Lasso Regularization of Generalized Linear Models - MATLAB & Simulink
No ratings yet
Lasso Regularization of Generalized Linear Models - MATLAB & Simulink
14 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
hw3 Solutions PDF
No ratings yet
hw3 Solutions PDF
11 pages
Max and Min PDF
No ratings yet
Max and Min PDF
19 pages
Least Square Vs Gradient Descent
100% (1)
Least Square Vs Gradient Descent
52 pages
Matlab - Array Manipulation and Tricks - Blink Dagger
No ratings yet
Matlab - Array Manipulation and Tricks - Blink Dagger
45 pages
Machine Learning
100% (1)
Machine Learning
185 pages
Vector Notes For IIT JEE - pdf-62
No ratings yet
Vector Notes For IIT JEE - pdf-62
8 pages
C OMBINATORIAL M ODELS OF C OMPLEX S YSTEMSTesis Doctorado Eng
No ratings yet
C OMBINATORIAL M ODELS OF C OMPLEX S YSTEMSTesis Doctorado Eng
194 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Distributions, Frobenious Theorem & Transformations: Harry G. Kwatny
No ratings yet
Distributions, Frobenious Theorem & Transformations: Harry G. Kwatny
22 pages
Syllabus - LA For DS
No ratings yet
Syllabus - LA For DS
2 pages
20) Area of A Triangle (Using Sine)
No ratings yet
20) Area of A Triangle (Using Sine)
13 pages
Ps 1
No ratings yet
Ps 1
5 pages
Demir - IJCTA - 2000 Floquet Theory and Non-Linear Perturbation Analysis For Oscillators With Differential-Algebraic Equations
No ratings yet
Demir - IJCTA - 2000 Floquet Theory and Non-Linear Perturbation Analysis For Oscillators With Differential-Algebraic Equations
23 pages
10.2 Introduction To Vectors
No ratings yet
10.2 Introduction To Vectors
7 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Rank and Nullity
No ratings yet
Rank and Nullity
6 pages
WWW - Ssmrmh.ro:) Then
No ratings yet
WWW - Ssmrmh.ro:) Then
4 pages
ML Lab
No ratings yet
ML Lab
21 pages
EE 769 Introduction To Machine Learning: Sheet 4 - 2020-21-2 Linear Classification
No ratings yet
EE 769 Introduction To Machine Learning: Sheet 4 - 2020-21-2 Linear Classification
4 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
2 pages
DPP 39-M
No ratings yet
DPP 39-M
2 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Problem Set 1 Foundations: PHYC30016 Electrodynamics
No ratings yet
Problem Set 1 Foundations: PHYC30016 Electrodynamics
2 pages
Hayashi CH 1 Answers
No ratings yet
Hayashi CH 1 Answers
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2021 EE769 Tutorial Sheet 1

Uploaded by

2021 EE769 Tutorial Sheet 1

Uploaded by

EE 769 Introduction to Machine Learning

(c) Find the cosine of the angle between two vectors:

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 1 of 4

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 2 of 4

4. Python: Set up an ipython notebook in Google CoLab (https://colab.research.

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 3 of 4

5. Gradient descent or ascend, and Lagrange multiplier:

Department of Electrical Engineering, Indian Institute of Technology Bombay Page 4 of 4

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.