0% found this document useful (0 votes)

5 views

ML MANUAL (1)

The document outlines the implementation of various machine learning algorithms including FIND-S, Candidate-Elimination, ID3 decision tree, and Backpropagation for neural networks. It provides detailed algorithms, sample training data, and Python code for each method, demonstrating how to derive hypotheses, build decision trees, and train neural networks. Each section includes explanations of the algorithms, their processes, and expected outputs based on provided datasets.

Uploaded by

sowjanyaanuku0304005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

ML MANUAL (1)

Uploaded by

sowjanyaanuku0304005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 74

1.

Implement and demonstrate the FIND-S algorithm for finding the most specific
hypothesis based on a given set of training data samples. Read the training data from a .CSV
file.

FIND-S Algorithm
1. Initialize h to the most specific hypothesis in H
2. For each positive training instance x
For each attribute constraint ai in h

If the constraint ai is satisfied by x

Then do nothing
Else replace ai in h by the next more general constraint that is satisfied by x
3. Output hypothesis h

Training Examples:

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

Program:

import csv

a = []

with open('enjoysport.csv', 'r') as csvfile:

for row in csv.reader(csvfile):
a.append(row)
print(a)

print("\n The total number of training instances are :

",len(a)) num_attribute = len(a[0])-1

print("\n The initial hypothesis is : ")

hypothesis = ['0']*num_attribute
print(hypothesis)

for i in range(0, len(a)):

if a[i][num_attribute] == 'yes':
for j in range(0, num_attribute):
if hypothesis[j] == '0' or hypothesis[j] == a[i]
[j]: hypothesis[j] = a[i][j]
else:
hypothesis[j] = '?'
print("\n The hypothesis for the training instance {} is :
\n" .format(i+1),hypothesis)

print("\n The Maximally specific hypothesis for the training

instance is ")
print(hypothesis)
Data Set:

sunny warm normal strong warm same yes

sunny warm high strong warm same yes
rainy cold high strong warm change no
sunny warm high strong cool change yes

Output:

The Given Training Data Set

['sunny', 'warm', 'normal', 'strong', 'warm', 'same', 'yes']

['sunny', 'warm', 'high', 'strong', 'warm', 'same', 'yes']
['rainy', 'cold', 'high', 'strong', 'warm', 'change', 'no']
['sunny', 'warm', 'high', 'strong', 'cool', 'change', 'yes']

The total number of training instances are : 4

The initial hypothesis is :

['0', '0', '0', '0', '0', '0']

The hypothesis for the training instance 1 is :

['sunny', 'warm', 'normal', 'strong', 'warm', 'same']

The hypothesis for the training instance 2 is :

['sunny', 'warm', '?', 'strong', 'warm', 'same']

The hypothesis for the training instance 3 is :

['sunny', 'warm', '?', 'strong', 'warm', 'same']

The hypothesis for the training instance 4 is :

['sunny', 'warm', '?', 'strong', '?', '?']

The Maximally specific hypothesis for the training instance is

['sunny', 'warm', '?', 'strong', '?', '?']
2.For a given set of training data examples stored in a .CSV file, implement and
demonstrate the Candidate-Elimination algorithm to output a description of the set of all
hypotheses consistent with the training examples.

CANDIDATE-ELIMINATION Learning Algorithm

The CANDIDATE-ELIMINTION algorithm computes the version space containing all

hypotheses from H that are consistent with an observed sequence of training examples.

Initialize G to the set of maximally general hypotheses in H

Initialize S to the set of maximally specific hypotheses in H
For each training example d, do
• If d is a positive example
• Remove from G any hypothesis inconsistent with d
• For each hypothesis s in S that is not consistent with d
• Remove s from S
• Add to S all minimal generalizations h of s such that
• h is consistent with d, and some member of G is more general than h
• Remove from S any hypothesis that is more general than another hypothesis in S

• If d is a negative example
• Remove from S any hypothesis inconsistent with d
• For each hypothesis g in G that is not consistent with d
• Remove g from G
• Add to G all minimal specializations h of g such that
• h is consistent with d, and some member of S is more specific than h
• Remove from G any hypothesis that is less general than another hypothesis in G

CANDIDATE- ELIMINTION algorithm using version spaces

Training Examples:

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes
3 Rainy Cold High Strong Warm Change No
4 Sunny Warm High Strong Cool Change Yes
Program:

import numpy as np
import pandas as pd
data = pd.DataFrame(data=pd.read_csv('enjoysport.csv'))
concepts = np.array(data.iloc[:,0:-1])
print(concepts)
target = np.array(data.iloc[:,-1])
print(target)

def learn(concepts, target):

specific_h = concepts[0].copy()
print("initialization of specific_h and general_h")
print(specific_h)
general_h = [["?" for i in range(len(specific_h))] for i in
range(len(specific_h))]
print(general_h)
for i, h in enumerate(concepts):
if target[i] == "yes":
for x in range(len(specific_h)):
if h[x]!= specific_h[x]:
specific_h[x] ='?'
general_h[x][x] ='?'
print(specific_h)
print(specific_h)
if target[i] == "no":
for x in range(len(specific_h)):
if h[x]!= specific_h[x]:
general_h[x][x] = specific_h[x]
else:
general_h[x][x] = '?'
print(" steps of Candidate Elimination Algorithm",i+1)
print(specific_h)
print(general_h)
indices = [i for i, val in enumerate(general_h) if val ==
['?', '?', '?', '?', '?', '?']]
for i in indices:
general_h.remove(['?', '?', '?', '?', '?', '?'])
return specific_h, general_h
s_final, g_final = learn(concepts, target)
print("Final Specific_h:", s_final, sep="\n")
print("Final General_h:", g_final, sep="\n")
Data Set:

Sky AirTemp Humidity Wind Water Forecast EnjoySport

sunny warm normal strong warm same yes

sunny warm high strong warm same yes
rainy cold high strong warm change no
sunny warm high strong cool change yes

Output:

Final Specific_h:
['sunny' 'warm' '?' 'strong' '?' '?']

Final General_h:
[['sunny', '?', '?', '?', '?', '?'],
['?', 'warm', '?', '?', '?', '?']]

3. Write a program to demonstrate the working of the decision tree based ID3 algorithm. Use
an appropriate data set for building the decision tree and apply this knowledge to classify a
new sample.
ID3 Algorithm

ID3(Examples, Target_attribute, Attributes)

Examples are the training examples. Target_attribute is the attribute whose value is to
be predicted by the tree. Attributes is a list of other attributes that may be tested by the
learned decision tree. Returns a decision tree that correctly classifies the given
Examples.

 Create a Root node for the tree

 If all Examples are positive, Return the single-node tree Root, with label = +
 If all Examples are negative, Return the single-node tree Root, with label = -
 If Attributes is empty, Return the single-node tree Root, with label = most common value
of Target_attribute in Examples

 Otherwise Begin
 A ← the attribute from Attributes that best* classifies Examples
 The decision attribute for Root ← A
 For each possible value, vi, of A,
 Add a new tree branch below Root, corresponding to the test A = vi
 Let Examples vi, be the subset of Examples that have value vi for A
 If Examples vi , is empty
 Then below this new branch add a leaf node with label = most common
value of Target_attribute in Examples
 Else below this new branch add the subtree
ID3(Examples vi, Targe_tattribute, Attributes – {A}))
 End
 Return Root

* The best attribute is the one with highest information gain

ENTROPY:
Entropy measures the impurity of a
collection of examples.

Where, p+ is the proportion of positive examples in S

p- is the proportion of negative examples in S.

INFORMATION GAIN:

 Information gain, is the expected reduction in entropy caused by partitioning

the examples according to this attribute.
 The information gain, Gain(S, A) of an attribute A, relative to a collection of examples
S, is defined as

Training Dataset:

Day Outlook Temperature Humidity Wind PlayTennis

D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

Test Dataset:

Day Outlook Temperature Humidity Wind

T1 Rain Cool Normal Strong
T2 Sunny Mild Normal Strong
Program:

import math
import csv

def load_csv(filename):
lines=csv.reader(open(filename,"r"));
dataset = list(lines)
headers = dataset.pop(0)
return dataset,headers

class Node:
def init (self,attribute):
self.attribute=attribute
self.children=[]
self.answer=""

def subtables(data,col,delete):
dic={}
coldata=[row[col] for row in data]
attr=list(set(coldata))

counts=[0]*len(attr)
r=len(data)
c=len(data[0])
for x in range(len(attr)):
for y in range(r):
if data[y][col]==attr[x]:
counts[x]+=1

for x in range(len(attr)):
dic[attr[x]]=[[0 for i in range(c)] for j in
range(counts[x])]
pos=0
for y in range(r):
if data[y][col]==attr[x]:
if delete:
del data[y][col]
dic[attr[x]][pos]=data[y]
pos+=1
return attr,dic
def entropy(S):
attr=list(set(S))
if len(attr)==1:
return 0

counts=[0,0]
for i in range(2):
counts[i]=sum([1 for x in S if attr[i]==x])/(len(S)*1.0)

sums=0
for cnt in counts:
sums+=-1*cnt*math.log(cnt,2)
return sums

def compute_gain(data,col):
attr,dic = subtables(data,col,delete=False)

total_size=len(data)
entropies=[0]*len(attr)
ratio=[0]*len(attr)

total_entropy=entropy([row[-1] for row in data])

for x in range(len(attr)):
ratio[x]=len(dic[attr[x]])/(total_size*1.0)
entropies[x]=entropy([row[-1] for row in
dic[attr[x]]])
total_entropy-=ratio[x]*entropies[x]
return total_entropy

def build_tree(data,features):
lastcol=[row[-1] for row in data]
if(len(set(lastcol)))==1:
node=Node("")
node.answer=lastcol[0]
return node

n=len(data[0])-1
gains=[0]*n
for col in range(n):
gains[col]=compute_gain(data,col)
split=gains.index(max(gains))
node=Node(features[split])
fea = features[:split]+features[split+1:]
attr,dic=subtables(data,split,delete=True)
for x in range(len(attr)):
child=build_tree(dic[attr[x]],fea)
node.children.append((attr[x],child))
return node

def print_tree(node,level):
if node.answer!="":
print(" "*level,node.answer)
return

print(" "*level,node.attribute)
for value,n in node.children:
print(" "*(level+1),value)
print_tree(n,level+2)

def classify(node,x_test,features):
if node.answer!="":
print(node.answer)
return
pos=features.index(node.attribute)
for value, n in node.children:
if x_test[pos]==value:
classify(n,x_test,features)

'''Main program'''
dataset,features=load_csv("data3.csv")
node1=build_tree(dataset,features)

print("The decision tree for the dataset using ID3 algorithm

is")
print_tree(node1,0)
testdata,features=load_csv("data3_test.csv")
for xtest in testdata:
print("The test instance:",xtest)
print("The label for test instance:",end=" ")
classify(node1,xtest,features)
Output:

The decision tree for the dataset using ID3 algorithm is

Outlook
rain
Wind
strong
no
weak
yes
overcast
yes
sunny
Humidity
normal
yes
high
no

The test instance: ['rain', 'cool', 'normal', 'strong']

The label for test instance: no

The test instance: ['sunny', 'mild', 'normal', 'strong']

The label for test instance: yes

4. Build an Artificial Neural Network by implementing the Backpropagation algorithm and

test the same using appropriate data sets.
BACKPROPAGATION Algorithm

BACKPROPAGATION (training_example, ƞ, nin, nout, nhidden )

Each training example is a pair of the form 𝑡→ ), where ( 𝑥→ ) is the vector of

→
(⃗𝑥→, network
input values, (𝑡 ) and is the vector of target network output values.
ƞ is the learning rate (e.g., .05). ni, is the number of network inputs, nhidden the number
of units in the hidden layer, and nout the number of output units.
The input from unit i into unit j is denoted xji, and the weight from unit i to unit j is
denoted wji

 Create a feed-forward network with ni inputs, nhidden hidden units, and nout output
units.
 Initialize all network weights to small random numbers
 Until the termination condition is met, Do

 For each 𝑡→ ), in training examples, Do

(⃗𝑥→,

1. Input the instance 𝑥

⃗ →, to the network and compute the output ou of
Propagate the input forward through the network:

every unit u in the network.

Propagate the errors backward through the network:

Training Examples:

Expected % in
Example Sleep Study
Exams
1 2 9 92
2 1 5 86
3 3 6 89

Normalize the input

Expected %
Example Sleep Study
in Exams
1 2/3 = 0.66666667 9/9 = 1 0.92
2 1/3 = 0.33333333 5/9 = 0.55555556 0.86
3 3/3 = 1 6/9 = 0.66666667 0.89

Program:

import numpy as np
X = np.array(([2, 9], [1, 5], [3, 6]), dtype=float)
y = np.array(([92], [86], [89]), dtype=float)
X = X/np.amax(X,axis=0) # maximum of X array longitudinally
y = y/100

#Sigmoid Function
def sigmoid (x):
return 1/(1 + np.exp(-x))

#Derivative of Sigmoid Function

def derivatives_sigmoid(x):
return x * (1 - x)

#Variable initialization
epoch=5000 #Setting training iterations
lr=0.1 #Setting learning rate
inputlayer_neurons = 2 #number of features in data set
hiddenlayer_neurons = 3 #number of hidden layers neurons
output_neurons = 1 #number of neurons at output layer
#weight and bias initialization
wh=np.random.uniform(size=(inputlayer_neurons,hiddenlayer_neur
ons))
bh=np.random.uniform(size=(1,hiddenlayer_neurons))
wout=np.random.uniform(size=(hiddenlayer_neurons,output_neuron
s))
bout=np.random.uniform(size=(1,output_neurons))

#draws a random range of numbers uniformly of dim x*y

for i in range(epoch):

#Forward Propogation
hinp1=np.dot(X,wh)
hinp=hinp1 + bh
hlayer_act = sigmoid(hinp)
outinp1=np.dot(hlayer_act,wout)
outinp= outinp1+ bout
output = sigmoid(outinp)

#Backpropagation
EO = y-output
outgrad = derivatives_sigmoid(output)
d_output = EO* outgrad
EH = d_output.dot(wout.T)

#how much hidden layer wts contributed to error

hiddengrad = derivatives_sigmoid(hlayer_act)
d_hiddenlayer = EH * hiddengrad

# dotproduct of nextlayererror and currentlayerop

wout += hlayer_act.T.dot(d_output) *lr
wh += X.T.dot(d_hiddenlayer) *lr

print("Input: \n" + str(X))

print("Actual Output: \n" + str(y))
print("Predicted Output: \n" ,output)
Output:

Input:
[[0.66666667 1. ]
[0.33333333 0.55555556]
[1. 0.66666667]]

Actual Output:
[[0.92]
[0.86]
[0.89]]

Predicted Output:
[[0.89726759]
[0.87196896]
[0.9000671]]

5. Write a program to implement the naïve Bayesian classifier for a sample training data set
stored as a .CSV file. Compute the accuracy of the classifier, considering few test data sets.
Bayes’ Theorem is stated as:

Where,
P(h|D) is the probability of hypothesis h given the data D. This is called the posterior
probability.
P(D|h) is the probability of data d given that the hypothesis h was true.
P(h) is the probability of hypothesis h being true. This is called the prior probability of h.
P(D) is the probability of the data. This is called the prior probability of D

interested in finding the most probable hypothesis h ∈ H given the observed data D. Any
After calculating the posterior probability for a number of different hypotheses h, and is

such maximally probable hypothesis is called a maximum a posteriori (MAP) hypothesis.

Bayes theorem to calculate the posterior probability of each candidate hypothesis is hMAP is a
MAP hypothesis provided

(Ignoring P(D) since it is a constant)

Gaussian Naive Bayes

A Gaussian Naive Bayes algorithm is a special type of Naïve Bayes algorithm. It’s
specifically used when the features have continuous values. It’s also assumed that all the
features are following a Gaussian distribution i.e., normal distribution

Representation for Gaussian Naive Bayes

We calculate the probabilities for input values for each class using a frequency. With real-
valued inputs, we can calculate the mean and standard deviation of input values (x) for each
class to summarize the distribution.

This means that in addition to the probabilities for each class, we must also store the mean
and standard deviations for each input variable for each class.

Gaussian Naive Bayes Model from Data

The probability density function for the normal distribution is defined by two parameters
(mean and standard deviation) and calculating the mean and standard deviation values of each
input variable (x) for each class value.

Example: Refer the link

http://chem-eng.utoronto.ca/~datamining/dmc/naive_bayesian.htm
Examples:
 The data set used in this program is the Pima Indians Diabetes problem.
 This data set is comprised of 768 observations of medical details for Pima Indians
patents. The records describe instantaneous measurements taken from the patient such
as their age, the number of times pregnant and blood workup. All patients are women
aged 21 or older. All attributes are numeric, and their units vary from attribute to
attribute.
 The attributes are Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin, BMI,
DiabeticPedigreeFunction, Age, Outcome
 Each record has a class value that indicates whether the patient suffered an onset of
diabetes within 5 years of when the measurements were taken (1) or not (0)

Sample Examples:
Examples Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Diabetic Age Outcome
Pedigree
Function
1 6 148 72 35 0 33.6 0.627 50 1
2 1 85 66 29 0 26.6 0.351 31 0
3 8 183 64 0 0 23.3 0.672 32 1
4 1 89 66 23 94 28.1 0.167 21 0
5 0 137 40 35 168 43.1 2.288 33 1
6 5 116 74 0 0 25.6 0.201 30 0
7 3 78 50 32 88 31 0.248 26 1
8 10 115 0 0 0 35.3 0.134 29 0
9 2 197 70 45 543 30.5 0.158 53 1
10 8 125 96 0 0 0 0.232 54 1
Program:

import csv
import random
import math

def loadcsv(filename):
lines = csv.reader(open(filename, "r"));
dataset = list(lines)
for i in range(len(dataset)):
#converting strings into numbers for processing
dataset[i] = [float(x) for x in dataset[i]]

return dataset

def splitdataset(dataset, splitratio):

#67% training size
trainsize = int(len(dataset) * splitratio);
trainset = []
copy = list(dataset);
while len(trainset) < trainsize:
#generate indices for the dataset list randomly to pick ele
for training data
index = random.randrange(len(copy));
trainset.append(copy.pop(index))
return [trainset, copy]

def separatebyclass(dataset):
separated = {} #dictionary of classes 1 and 0
#creates a dictionary of classes 1 and 0 where the values are
#the instances belonging to each class
for i in range(len(dataset)):
vector = dataset[i]
if (vector[-1] not in separated):
separated[vector[-1]] = []
separated[vector[-1]].append(vector)
return separated

def mean(numbers):
return sum(numbers)/float(len(numbers))

def stdev(numbers):
avg = mean(numbers)
variance = sum([pow(x-avg,2) for x in
numbers])/float(len(numbers)-1)
return math.sqrt(variance)
def summarize(dataset): #creates a dictionary of classes
summaries = [(mean(attribute), stdev(attribute)) for
attribute in zip(*dataset)];
del summaries[-1] #excluding labels +ve or -ve
return summaries

def summarizebyclass(dataset):
separated = separatebyclass(dataset);
#print(separated)
summaries = {}
for classvalue, instances in separated.items():
#for key,value in dic.items()
#summaries is a dic of tuples(mean,std) for each class value
summaries[classvalue] = summarize(instances)
#summarize is used to cal to mean and std
return summaries

def calculateprobability(x, mean, stdev):

exponent = math.exp(-(math.pow(x-mean,2)/
(2*math.pow(stdev,2))))
return (1 / (math.sqrt(2*math.pi) * stdev)) * exponent

def calculateclassprobabilities(summaries, inputvector):

# probabilities contains the all prob of all class of test
data probabilities = {}
for classvalue, classsummaries in summaries.items():
#class and attribute information as mean and sd
probabilities[classvalue] = 1
for i in range(len(classsummaries)):
mean, stdev = classsummaries[i] #take mean and
sd of every attribute for class 0 and 1 seperaely
x = inputvector[i] #testvector's first attribute
probabilities[classvalue] *=
calculateprobability(x, mean, stdev);#use normal dist
return probabilities

def predict(summaries, inputvector): #training and test data

is passed
probabilities = calculateclassprobabilities(summaries,
inputvector)
bestLabel, bestProb = None, -1
for classvalue, probability in probabilities.items():
#assigns that class which has the highest prob
if bestLabel is None or probability > bestProb:
bestProb = probability
bestLabel = classvalue
return bestLabel
def getpredictions(summaries,
testset): predictions = []
for i in range(len(testset)):
result = predict(summaries,
testset[i]) predictions.append(result)
return predictions

def getaccuracy(testset,
predictions): correct = 0
for i in range(len(testset)):
if testset[i][-1] ==
predictions[i]: correct += 1
return (correct/float(len(testset))) * 100.0

def main():
filename =
'naivedata.csv'
splitratio = 0.67
dataset = loadcsv(filename);

trainingset, testset = splitdataset(dataset,

splitratio) print('Split {0} rows into train={1} and
test={2}
rows'.format(len(dataset), len(trainingset),
len(testset))) # prepare model
summaries =
summarizebyclass(trainingset);
#print(summaries)
# test model
predictions = getpredictions(summaries, testset) #find
the predictions of test data with the training data
accuracy = getaccuracy(testset,
predictions) print('Accuracy of the
classifier is :
{0}%'.format(accuracy))

main()

Output:

Split 768 rows into train=514 and test=254 rows

Accuracy of the classifier is : 71.65354330708661%
6. Assuming a set of documents that need to be classified, use the naïve Bayesian Classifier
model to perform this task. Built-in Java classes/API can be used to write the program.
Calculate the accuracy, precision, and recall for your data set.

Naive Bayes algorithms for learning and classifying text

LEARN_NAIVE_BAYES_TEXT (Examples, V)
Examples is a set of text documents along with their target
values. V is the set of all possible target values. This
function learns the probability terms P(wk |vj,), describing
the probability that a randomly drawn word from a document in
class vj will be the English word wk. It also learns the class
prior probabilities P(vj).

1. collect all words, punctuation, and other tokens that occur in Examples
 Vocabulary ← c the set of all distinct words and other tokens occurring in any text
document from Examples

2. calculate the required P(vj) and P(wk|vj) probability terms

 For each target value vj in V do

 docsj ← the subset of documents from Examples for which the target value is vj
 P(vj) ← | docsj | / |Examples|
 Textj ← a single document created by concatenating all members of docsj
 n ← total number of distinct word positions in Textj
 for each word wk in Vocabulary
 nk ← number of times word wk occurs in Textj
 P(wk|vj) ← ( nk + 1) / (n + | Vocabulary| )

CLASSIFY_NAIVE_BAYES_TEXT (Doc)

Return the estimated target value for the document Doc. ai

denotes the word found in the ith position within Doc.

 positions ← all word positions in Doc that contain tokens found in Vocabulary
 Return VNB, where

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 1

Data set:

Text Documents Label

1 I love this sandwich pos
2 This is an amazing place pos
3 I feel very good about these beers pos
4 This is my best work pos
5 What an awesome view pos
6 I do not like this restaurant neg
7 I am tired of this stuff neg
8 I can't deal with this neg
9 He is my sworn enemy neg
10 My boss is horrible neg
11 This is an awesome place pos
12 I do not like the taste of this juice neg
13 I love to dance pos
14 I am sick and tired of this place neg
15 What a great holiday pos
16 That is a bad locality to stay neg
17 We will have good fun tomorrow pos
18 I went to my enemy's house today neg

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 2

Program:

import pandas as pd

msg=pd.read_csv('naivetext.csv',names=['message','label'])

print('The dimensions of the dataset',msg.shape)

msg['labelnum']=msg.label.map({'pos':1,'neg':0})
X=msg.message
y=msg.labelnum

print(X)
print(y)

#splitting the dataset into train and test data

from sklearn.model_selection import train_test_split
xtrain,xtest,ytrain,ytest=train_test_split(X,y)

print ('\n The total number of Training Data :',ytrain.shape)

print ('\n The total number of Test Data :',ytest.shape)

#output of count vectoriser is a sparse matrix

from sklearn.feature_extraction.text import CountVectorizer
count_vect = CountVectorizer()
xtrain_dtm = count_vect.fit_transform(xtrain)
xtest_dtm=count_vect.transform(xtest)
print('\n The words or Tokens in the text documents \n')
print(count_vect.get_feature_names())

df=pd.DataFrame(xtrain_dtm.toarray(),columns=count_vect.get_fe
ature_names())

# Training Naive Bayes (NB) classifier on training data.

from sklearn.naive_bayes import MultinomialNB
clf = MultinomialNB().fit(xtrain_dtm,ytrain)
predicted = clf.predict(xtest_dtm)

#printing accuracy, Confusion matrix, Precision and Recall

from sklearn import metrics
print('\n Accuracy of the classifer is’,
metrics.accuracy_score(ytest,predicted))

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 3

print('\n Confusion matrix')
print(metrics.confusion_matrix(ytest,predicted))

print('\n The value of Precision' ,

metrics.precision_score(ytest,predicted))

print('\n The value of Recall' ,

metrics.recall_score(ytest,predicted))

Output:

The dimensions of the dataset (18, 2)

0 I love this sandwich
1 This is an amazing place
2 I feel very good about these beers
3 This is my best work
4 What an awesome view
5 I do not like this restaurant
6 I am tired of this stuff
7 I can't deal with this
8 He is my sworn enemy
9 My boss is horrible
10 This is an awesome place
11 I do not like the taste of this juice
12 I love to dance
13 I am sick and tired of this place
14 What a great holiday
15 That is a bad locality to stay
16 We will have good fun tomorrow
17 I went to my enemy's house today

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 4

Name: message,
dtype: object 0 1
1 1
2 1
3 1
4 1
5 0
6 0
7 0
8 0
9 0
10 1
11 0
12 1
13 0
14 1
15 0
16 1
17 0
Name: labelnum, dtype: int64

The total number of

Training Data: (13,) The
total number of Test Data:
(5,)

The words or Tokens in the text documents

['about', 'am', 'amazing', 'an', 'and', 'awesome', 'beers',
'best', 'can', 'deal', 'do', 'enemy', 'feel',
'fun', 'good', 'great', 'have', 'he', 'holiday', 'house', 'is',
'like', 'love', 'my', 'not', 'of', 'place',
'restaurant', 'sandwich', 'sick', 'sworn', 'these', 'this',
'tired', 'to', 'today', 'tomorrow', 'very', 'view', 'we',
'went', 'what', 'will', 'with', 'work']

Accuracy of the
classifier is 0.8
Confusion matrix
[[2 1]
[0 2]]
The value of Precision
0.6666666666666666 The value
of Recall 1.0

Basic knowledge

Confusion Matrix

True positives: data points labelled as positive

that are actually positive False positives: data
points labelled as positive that are actually
negative True negatives: data points labelled as
negative that are actually negative False
negatives: data points labelled as negative that
are actually positive
DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 6
Example:

Accuracy: how often is the classifier correct?

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 7

Example: Movie Review

Doc Text Class

1 I loved the movie +

2 I hated the movie -
3 a great movie. good movie +
4 poor acting -
5 great acting. good movie +

Unique word
< I, loved, the, movie, hated, a, great, good, poor, acting>

Doc I loved the movie hated a great good poor acting Class

1 1 1 1 1 +
2 1 1 1 1 -
3 2 1 1 1 +
4 1 1 -
5 1 1 1 1 +

Doc I loved the movie hated a great good poor acting Class

1 1 1 1 1 +

3 2 1 1 1 +

5 1 1 1 1 +

3
𝑃(+) = 0.6
5
=

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 8

𝑃(𝐼 |+) = 𝑃(𝑎 |+) =
1+1 1+1
= 0.0833 = 0.0833
14 + 10 14 + 10

𝑃(𝑙𝑜𝑣𝑒𝑑 |+) = 𝑃(𝑔𝑟𝑒𝑎𝑡 |+) =

1+1 2+1
= 0.0833 = 0.125
14 + 10 14 + 10

𝑃(𝑡ℎ𝑒 |+) = 𝑃(𝑔𝑜𝑜𝑑 |+) =

1+1 2+1
= 0.0833 = 0.125
14 + 10 14 + 10

𝑃(𝑚𝑜𝑣𝑖𝑒 |+) = 𝑃(𝑝𝑜𝑜𝑟 |+) =

4+1 0+1
= 0.2083 = 0.0416
14 + 10 14 + 10

𝑃(ℎ𝑎𝑡𝑒𝑑 |+) = 𝑃(𝑎𝑐𝑡𝑖𝑛𝑔 |+) =

0+1 1+1
= 0.0416 = 0.0833
14 + 10 14 + 10

Doc I loved the movie hated a great good poor acting Class

2 1 1 1 1 -
4 1 1 -

2
𝑃(−) = 0.4
= 5

𝑃(𝐼 |−) = 𝑃(𝑎 |−) =

1+1 0+1
= 0.125 = 0.0625
6 + 10 6 + 10

𝑃(𝑙𝑜𝑣𝑒𝑑 |−) = 𝑃(𝑔𝑟𝑒𝑎𝑡 |−) =

0+1 0+1
= 0.0625 = 0.0625
6 + 10 6 + 10

𝑃(𝑡ℎ𝑒 |−) = 𝑃(𝑔𝑜𝑜𝑑 |−) =

1+1 0+1
= 0.125 = 0.0625
6 + 10 6 + 10

𝑃(𝑚𝑜𝑣𝑖𝑒|−) = 𝑃(𝑝𝑜𝑜𝑟|−) =
1+1 1+1
= 0.125 = 0.125
6 + 10 6 + 10

𝑃(ℎ𝑎𝑡𝑒𝑑 |−) = 𝑃(𝑎𝑐𝑡𝑖𝑛𝑔|−) =

1+1 1+1
= 0.125 = 0.125
6 + 10 6 + 10
DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 9
Let’s classify the new document

I hated the poor

acting
If Vj = + then,

= P(+) P(I | +) P(hated | +) P(the | +) P(poor | +) P(acting | +)

= 0.6 * 0.0833 * 0.0416 * 0.0833 * 0.0416 * 0.0833
= 6.03 X 10−2
If Vj = −

= 1.22 X 10−5

= 1.22 X 10−5 > 6.03 X 10−2

So, the new document belongs to ( − ) class

7. Write a program to construct a Bayesian network considering medical data. Use this model
to demonstrate the diagnosis of heart patients using standard Heart Disease Data Set. You can
use Java/Python ML library classes/API

Theory
A Bayesian network is a directed acyclic graph in which each
edge corresponds to a conditional dependency, and each node
corresponds to a unique random variable.

Bayesian network consists of two major parts: a directed

acyclic graph and a set of conditional probability
distributions
 The directed acyclic graph is a set of random variables represented by nodes.
 The conditional probability distribution of a node (random variable) is defined for
every possible outcome of the preceding causal node(s).

For illustration, consider the following example. Suppose we

attempt to turn on our computer, but the computer does not
start (observation/evidence). We would like to know which of
the possible causes of computer failure is more likely. In
this simplified illustration, we assume only two possible
causes of this misfortune: electricity failure and computer
malfunction.
The corresponding directed acyclic graph is depicted in below
figure.

Fig: Directed acyclic graph representing two independent possible

causes of a computer failure.

The goal is to calculate the posterior conditional probability

distribution of each of the possible unobserved causes given
the observed evidence, i.e. P [Cause | Evidence].
Data Set:
Title: Heart Disease Databases
The Cleveland database contains 76 attributes, but all
published experiments refer to using a subset of 14 of them.
In particular, the Cleveland database is the only one that has
been used by ML researchers to this date. The "Heartdisease"
field refers to the presence of heart disease in the patient.
It is integer valued from 0 (no presence) to 4.
Database: 0 1 2 3 4
Total
Cleveland: 164 55 36 35 13 303
Attribute Information:
1. age: age in years
2. sex: sex (1 = male; 0 = female)
3. cp: chest pain type
 Value 1: typical angina
 Value 2: atypical angina
 Value 3: non-anginal pain
 Value 4: asymptomatic
4. trestbps: resting blood pressure (in mm Hg on admission to the hospital)
5. chol: serum cholestoral in mg/dl
6. fbs: (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false)
7. restecg: resting electrocardiographic results
 Value 0: normal
 Value 1: having ST-T wave abnormality (T wave inversions and/or ST
elevation or depression of > 0.05 mV)
 Value 2: showing probable or definite left ventricular hypertrophy by Estes'
criteria
8. thalach: maximum heart rate achieved
9. exang: exercise induced angina (1 = yes; 0 = no)
10. oldpeak = ST depression induced by exercise relative to rest
11. slope: the slope of the peak exercise ST segment
 Value 1: upsloping
 Value 2: flat
 Value 3: downsloping
12. thal: 3 = normal; 6 = fixed defect; 7 = reversable defect
13. Heartdisease: It is integer valued from 0 (no presence) to 4.
Some instance from the dataset:
age sex cp trestbps chol fbs restecg thalach exang oldpeak slope ca thal Heartdisease
63 1 1 145 233 1 2 150 0 2.3 3 0 6 0
67 1 4 160 286 0 2 108 1 1.5 2 3 3 2
67 1 4 120 229 0 2 129 1 2.6 2 2 7 1
41 0 2 130 204 0 2 172 0 1.4 1 0 3 0
62 0 4 140 268 0 2 160 0 3.6 3 2 3 3
60 1 4 130 206 0 2 132 1 2.4 2 2 7 4

Program:

import numpy as np import

pandas as pd import csv
from pgmpy.estimators import MaximumLikelihoodEstimator from pgmpy.models
import BayesianModel
from pgmpy.inference import VariableElimination

#read Cleveland Heart Disease data

heartDisease = pd.read_csv('heart.csv')
heartDisease = heartDisease.replace('?',np.nan)

#display the data

print('Sample instances from the dataset are given below') print(heartDisease.head())

#display the Attributes names and datatyes

print('\n Attributes and datatypes')

print(heartDisease.dtypes)

#Creat Model- Bayesian Network

model = BayesianModel([('age','heartdisease'),('sex','heartdisease'),(
'exang','heartdisease'),('cp','heartdisease'),('heartdisease', 'restecg'),('heartdisease','chol')])

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 3

#Learning CPDs using Maximum Likelihood Estimators

print('\n Learning CPD using Maximum likelihood estimators')

model.fit(heartDisease,estimator=MaximumLikelihoodEstimator)

# Inferencing with Bayesian Network

print('\n Inferencing with Bayesian Network:')

HeartDiseasetest_infer = VariableElimination(model)

#computing the Probability of HeartDisease given restecg

print('\n 1.Probability of HeartDisease given evidence=
restecg :1')
q1=HeartDiseasetest_infer.query(variables=['heartdisease'],evi
dence={'restecg':1})
print(q1)

#computing the Probability of HeartDisease given cp

print('\n 2.Probability of HeartDisease given evidence= cp:2 ')

q2=HeartDiseasetest_infer.query(variables=['heartdisease'],evi
dence={'cp':2})
print(q2)

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 4

Output:

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 5

Machine Learning Laboratory
17CSL76

8. Write a program to construct a Bayesian network considering medical data. Use this model
to demonstrate the diagnosis of heart patients using standard Heart Disease Data Set. You can
Machine Learning Laboratory
17CSL76
use Java/Python ML library classes/API

Theory
A Bayesian network is a directed acyclic graph in which each
edge corresponds to a conditional dependency, and each node
corresponds to a unique random variable.

Bayesian network consists of two major parts: a directed

For illustration, consider the following example. Suppose we

Fig: Directed acyclic graph representing two independent possible

causes of a computer failure.

The goal is to calculate the posterior conditional probability

distribution of each of the possible unobserved causes given
the observed evidence, i.e. P [Cause | Evidence].
Machine Learning Laboratory
17CSL76

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 1

Machine Learning Laboratory
17CSL76
Data Set:
Title: Heart Disease Databases
The Cleveland database contains 76 attributes, but all
published experiments refer to using a subset of 14 of them.
In particular, the Cleveland database is the only one that has
been used by ML researchers to this date. The "Heartdisease"
field refers to the presence of heart disease in the patient.
It is integer valued from 0 (no presence) to 4.
Database: 0 1 2 3 4
Total
Cleveland: 164 55 36 35 13 303
Attribute Information:
14. age: age in years
15. sex: sex (1 = male; 0 = female)
16. cp: chest pain type
 Value 1: typical angina
 Value 2: atypical angina
 Value 3: non-anginal pain
 Value 4: asymptomatic
17. trestbps: resting blood pressure (in mm Hg on admission to the hospital)
18. chol: serum cholestoral in mg/dl
19. fbs: (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false)
20. restecg: resting electrocardiographic results
 Value 0: normal
 Value 1: having ST-T wave abnormality (T wave inversions and/or ST
elevation or depression of > 0.05 mV)
 Value 2: showing probable or definite left ventricular hypertrophy by Estes'
criteria
21. thalach: maximum heart rate achieved
22. exang: exercise induced angina (1 = yes; 0 = no)
23. oldpeak = ST depression induced by exercise relative to rest
24. slope: the slope of the peak exercise ST segment
 Value 1: upsloping
 Value 2: flat
 Value 3: downsloping
25. thal: 3 = normal; 6 = fixed defect; 7 = reversable defect
26. Heartdisease: It is integer valued from 0 (no presence) to 4.
Machine Learning Laboratory
17CSL76
DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 2
Machine Learning Laboratory 17CSL76

Some instance from the dataset:

age sex cp trestbps chol fbs restecg thalach exang oldpeak slope ca thal Heartdisease
63 1 1 145 233 1 2 150 0 2.3 3 0 6 0
67 1 4 160 286 0 2 108 1 1.5 2 3 3 2
67 1 4 120 229 0 2 129 1 2.6 2 2 7 1
41 0 2 130 204 0 2 172 0 1.4 1 0 3 0
62 0 4 140 268 0 2 160 0 3.6 3 2 3 3
60 1 4 130 206 0 2 132 1 2.4 2 2 7 4

Program:

import numpy as np import

pandas as pd import csv
from pgmpy.estimators import MaximumLikelihoodEstimator from pgmpy.models
import BayesianModel
from pgmpy.inference import VariableElimination

#read Cleveland Heart Disease data

heartDisease = pd.read_csv('heart.csv')
heartDisease = heartDisease.replace('?',np.nan)

#display the data

print('Sample instances from the dataset are given below') print(heartDisease.head())

#display the Attributes names and datatyes

print('\n Attributes and datatypes')

print(heartDisease.dtypes)

#Creat Model- Bayesian Network

model = BayesianModel([('age','heartdisease'),('sex','heartdisease'),(
'exang','heartdisease'),('cp','heartdisease'),('heartdisease', 'restecg'),('heartdisease','chol')])

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 3

#Learning CPDs using Maximum Likelihood Estimators

print('\n Learning CPD using Maximum likelihood estimators')

model.fit(heartDisease,estimator=MaximumLikelihoodEstimator)

# Inferencing with Bayesian Network

print('\n Inferencing with Bayesian Network:')

HeartDiseasetest_infer = VariableElimination(model)

#computing the Probability of HeartDisease given restecg

print('\n 1.Probability of HeartDisease given evidence=
restecg :1')
q1=HeartDiseasetest_infer.query(variables=['heartdisease'],evi
dence={'restecg':1})
print(q1)

#computing the Probability of HeartDisease given cp

print('\n 2.Probability of HeartDisease given evidence= cp:2 ')

q2=HeartDiseasetest_infer.query(variables=['heartdisease'],evi
dence={'cp':2})
print(q2)

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 4

Output:

DEEPAK D , ASSISTANT PROFESSOR, DEPT. OF CSE, CANARA ENGINEERING COLLEGE, MANGALURU 5

Exp: 8
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.cluster import KMeans
import sklearn.metrics as sm
import pandas as pd
import numpy as np

iris = datasets.load_iris()

X = pd.DataFrame(iris.data)
X.columns = ['Sepal_Length','Sepal_Width','Petal_Length','Petal_Width']

y = pd.DataFrame(iris.target)
y.columns = ['Targets']

model = KMeans(n_clusters=3)
model.fit(X)

plt.figure(figsize=(14,7))

colormap = np.array(['red', 'lime', 'black'])

# Plot the Original Classifications

plt.subplot(1, 2, 1)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y.Targets], s=40)
plt.title('Real Classification')
plt.xlabel('Petal Length')
plt.ylabel('Petal Width')

# Plot the Models Classifications

plt.subplot(1, 2, 2)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[model.labels_], s=40)
plt.title('K Mean Classification')
plt.xlabel('Petal Length')
plt.ylabel('Petal Width')
print('The accuracy score of K-Mean: ',sm.accuracy_score(y, model.labels_))
print('The Confusion matrixof K-Mean: ',sm.confusion_matrix(y, model.labels_))

from sklearn import preprocessing

scaler = preprocessing.StandardScaler()
scaler.fit(X)
xsa = scaler.transform(X)
xs = pd.DataFrame(xsa, columns = X.columns)
#xs.sample(5)

from sklearn.mixture import GaussianMixture

gmm = GaussianMixture(n_components=3)
gmm.fit(xs)

y_gmm = gmm.predict(xs)
#y_cluster_gmm

plt.subplot(2, 2, 3)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y_gmm], s=40)
plt.title('GMM Classification')
plt.xlabel('Petal Length')
plt.ylabel('Petal Width')

print('The accuracy score of EM: ',sm.accuracy_score(y, y_gmm))

print('The Confusion matrix of EM: ',sm.confusion_matrix(y, y_gmm))

9. Write a program to implement k-Nearest Neighbour algorithm to classify the iris data set.
Print both correct and wrong predictions. Java/Python ML library classes can be used for this
problem.

K-Nearest Neighbor Algorithm

Training algorithm:
 For each training example (x, f (x)), add the example to the list training
examples Classification algorithm:
 Given a query instance xq to be classified,
 Let x1 . . .xk denote the k instances from training examples that are nearest to xq

Return

 Where, f(xi) function to calculate the mean value of the k nearest training
examples.

Data Set:

Iris Plants Dataset: Dataset contains 150 instances

(50 in each of three classes) Number of Attributes:
4 numeric, predictive attributes and the Class
Program:

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import classification_report, confusion_matrix
from sklearn import datasets

""" Iris Plants Dataset, dataset contains 150 (50 in each of three
classes)Number of Attributes: 4 numeric, predictive attributes and
the Class
"""
iris=datasets.load_iris()

""" The x variable contains the first four columns of the dataset
(i.e. attributes) while y contains the labels.
"""
x = iris.data
y = iris.target

print ('sepal-length', 'sepal-width', 'petal-length', 'petal-width')

print(x)
print('class: 0-Iris-Setosa, 1- Iris-Versicolour, 2- Iris-Virginica')
print(y)

""" Splits the dataset into 70% train data and 30% test data. This
means that out of total 150 records, the training set will contain
105 records and the test set contains 45 of those records
"""
x_train, x_test, y_train, y_test =
train_test_split(x,y,test_size=0.3)

#To Training the model and Nearest nighbors K=5

classifier = KNeighborsClassifier(n_neighbors=5)
classifier.fit(x_train, y_train)

#to make predictions on our test data

y_pred=classifier.predict(x_test)

""" For evaluating an algorithm, confusion matrix, precision, recall

and f1 score are the most commonly used metrics.
"""
print('Confusion Matrix')
print(confusion_matrix(y_test,y_pred))
print('Accuracy Metrics')
print(classification_report(y_test,y_pred))
Output:

sepal-length sepal-width petal-length petal-width

[[5.1 3.5 1.4 0.2]
[4.9 3. 1.4 0.2]
[4.7 3.2 1.3 0.2]
[4.6 3.1 1.5 0.2]
[5. 3.6 1.4 0.2]
. . . . .
. . . . .

[6.2 3.4 5.4 2.3]

[5.9 3. 5.1 1.8]]

class: 0-Iris-Setosa, 1- Iris-Versicolour, 2- Iris-Virginica

[0 0 0 ………0 0 1 1 1 …………1 1 2 2 2 ………… 2 2]

Confusion Matrix
[[20 0 0]
[ 0 10 0]
[ 0 1 14]]

Accuracy Metrics

Precision recall f1-score support

0 1.00 1.00 1.00 20

1 0.91 1.00 0.95 10
2 1.00 0.93 0.97 15

avg / total 0.98 0.98 0.98 45

Basic knowledge
Confusion Matrix

True positives: data points labelled as positive that are actually positive
False positives: data points labelled as positive that are actually negative
True negatives: data points labelled as negative that are actually
negative False negatives: data points labelled as negative that are
actually positive

Accuracy: how often is the classifier correct?

F1-Score:

Support: Total Predicted of Class.

Support = TP + FN
Example:

 Support _ A = TP_A + FN_A

= 30 + (20 + 10)
= 60

10. Implement the non-parametric Locally Weighted Regression algorithm in order to fit data
points. Select appropriate data set for your experiment and draw graphs.

Locally Weighted Regression Algorithm

Regression:
 Regression is a technique from statistics that is used to predict values of a desired
target quantity when the target quantity is continuous.
 In regression, we seek to identify (or estimate) a continuous variable y associated with
a given input vector x.
 y is called the dependent variable.
 x is called the independent variable.

Loess/Lowess Regression:
Loess regression is a nonparametric technique that uses
local weighted regression to fit a smooth curve through
points in a scatter plot.
Lowess Algorithm:
 Locally weighted regression is a very powerful nonparametric model used in
statistical learning.
 Given a dataset X, y, we attempt to find a model parameter β(x) that
minimizes residual sum of weighted squared errors.
 The weights are given by a kernel function (k or w) which can be chosen arbitrarily

Algorithm

1. Read the Given data Sample to X and the curve (linear or non linear) to Y
2. Set the value for Smoothening parameter or Free parameter say τ
3. Set the bias /Point of interest set x0 which is a subset of X
4. Determine the weight matrix using :

5. Determine the value of model term parameter β using :

6. Prediction = x0*β:

Program

import numpy as np
from bokeh.plotting import figure, show, output_notebook
from bokeh.layouts import gridplot
from bokeh.io import push_notebook

def local_regression(x0, X, Y, tau):# add bias term

x0 = np.r_[1, x0] # Add one to avoid the loss in
information
X = np.c_[np.ones(len(X)), X]

# fit model: normal equations with kernel

xw = X.T * radial_kernel(x0, X, tau) # XTranspose * W

beta = np.linalg.pinv(xw @ X) @ xw @ Y #@ Matrix

Multiplication or Dot Product
# predict value
return x0 @ beta # @ Matrix Multiplication or Dot Product
for prediction
def radial_kernel(x0, X, tau):
return np.exp(np.sum((X - x0) ** 2, axis=1) / (-2 * tau *
tau))
# Weight or Radial Kernal Bias Function

n = 1000
# generate dataset
X = np.linspace(-3, 3, num=n)
print("The Data Set ( 10 Samples) X :\n",X[1:10])
Y = np.log(np.abs(X ** 2 - 1) + .5)
print("The Fitting Curve Data Set (10 Samples) Y
:\n",Y[1:10])
# jitter X
X += np.random.normal(scale=.1, size=n)
print("Normalised (10 Samples) X :\n",X[1:10])

domain = np.linspace(-3, 3, num=300)

print(" Xo Domain Space(10 Samples) :\n",domain[1:10])
def plot_lwr(tau):
# prediction through regression
prediction = [local_regression(x0, X, Y, tau) for x0 in
domain]
plot = figure(plot_width=400, plot_height=400)
plot.title.text='tau=%g' % tau
plot.scatter(X, Y, alpha=.3)
plot.line(domain, prediction, line_width=2, color='red')
return plot

show(gridplot([
[plot_lwr(10.), plot_lwr(1.)],
[plot_lwr(0.1), plot_lwr(0.01)]]))
Output
# -*- coding:
utf-8 -*- """
Spyder Editor

This is a temporary
script file. """
from numpy
import * from
os import
listdir import
matplotlib
import
matplotlib.pyplot as
plt import pandas as
pd
import numpy as
np1 import
numpy.linalg as np
from scipy.stats.stats import pearsonr

def
kernel(point,xm
at, k): m,n =
np1.shape(xmat)
weights =
np1.mat(np1.eye((m)))
for j in range(m):
diff = point - X[j]
weights[j,j] =
np1.exp(diff*diff.T/(-2.0*k**2))
return weights

def
localWeight(point,xmat,y
mat,k): wei =
kernel(point,xmat,k)
W =
(X.T*(wei*X)).I*(X.T*(wei*ymat.T
)) return W

def
localWeightRegression(xmat,
ymat,k): m,n =
np1.shape(xmat)
ypred =
np1.zeros(m)
for i in
range(m):
ypred[i] =
xmat[i]*localWeight(xmat[i],xmat,ymat,k)
return ypred

# load data points

data =
pd.read_csv('tips.cs
v') bill =
np1.array(data.total
_bill) tip =
np1.array(data.tip)

#preparing and add

1 in bill mbill =
np1.mat(bill)
mtip = np1.mat(tip) # mat is used to convert to n dimesiona
to 2 dimensional array form m= np1.shape(mbill)[1]
# print(m) 244 data is
stored in m one =
np1.mat(np1.ones(m))
X= np1.hstack((one.T,mbill.T)) # create a stack
of bill from ONE #print(X)
#set k here
ypred =
localWeightRegression(X,mtip,
0.3) SortIndex =
X[:,1].argsort(0)
xsort = X[SortIndex][:,0]

fig = plt.figure()
ax =
fig.add_subplot(1,1,1
)
ax.scatter(bill,tip,
color='green')
ax.plot(xsort[:,1],ypred[SortIndex], color = 'red',
linewidth=5) plt.xlabel('Total bill')
plt.ylabel('Tip')
plt.show();

11. Implement the non-parametric Locally Weighted Regression algorithm in order to fit data
points. Select appropriate data set for your experiment and draw graphs.

Locally Weighted Regression Algorithm

Algorithm

7. Read the Given data Sample to X and the curve (linear or non linear) to Y
8. Set the value for Smoothening parameter or Free parameter say τ
9. Set the bias /Point of interest set x0 which is a subset of X
10. Determine the weight matrix using :

11. Determine the value of model term parameter β using :

12. Prediction = x0*β:

Program

import numpy as np
from bokeh.plotting import figure, show, output_notebook
from bokeh.layouts import gridplot
from bokeh.io import push_notebook

def local_regression(x0, X, Y, tau):# add bias term

x0 = np.r_[1, x0] # Add one to avoid the loss in
information
X = np.c_[np.ones(len(X)), X]

# fit model: normal equations with kernel

xw = X.T * radial_kernel(x0, X, tau) # XTranspose * W

beta = np.linalg.pinv(xw @ X) @ xw @ Y #@ Matrix

domain = np.linspace(-3, 3, num=300)

show(gridplot([
[plot_lwr(10.), plot_lwr(1.)],
[plot_lwr(0.1), plot_lwr(0.01)]]))
Output
# -*- coding:
utf-8 -*- """
Spyder Editor

def
kernel(point,xm
at, k): m,n =
np1.shape(xmat)
weights =
np1.mat(np1.eye((m)))
for j in range(m):
diff = point - X[j]
weights[j,j] =
np1.exp(diff*diff.T/(-2.0*k**2))
return weights

def
localWeight(point,xmat,y
mat,k): wei =
kernel(point,xmat,k)
W =
(X.T*(wei*X)).I*(X.T*(wei*ymat.T
)) return W

def
localWeightRegression(xmat,
ymat,k): m,n =
np1.shape(xmat)
ypred =
np1.zeros(m)
for i in
range(m):
ypred[i] =
xmat[i]*localWeight(xmat[i],xmat,ymat,k
) return ypred

# load data points

data =
pd.read_csv('tips.
csv') bill =
np1.array(data.tot
al_bill) tip =
np1.array(data.tip
)

#preparing and
add 1 in bill
mbill =
np1.mat(bill)
mtip = np1.mat(tip) # mat is used to convert to n
dimesiona to 2 dimensional array form m= np1.shape(mbill)
[1]
# print(m) 244 data
is stored in m one =
np1.mat(np1.ones(m))
X= np1.hstack((one.T,mbill.T)) # create a stack
of bill from ONE #print(X)
#set k here
ypred =
localWeightRegression(X,mti
p,0.3) SortIndex =
X[:,1].argsort(0)
xsort = X[SortIndex][:,0]

fig = plt.figure()
ax =
fig.add_subplot(1,1
,1)
ax.scatter(bill,tip
, color='green')
ax.plot(xsort[:,1],ypred[SortIndex], color =
'red', linewidth=5) plt.xlabel('Total bill')
plt.ylabel('Tip
') plt.show();

AI Programming CAT 1 N CAT 2 Muchiri
0% (1)
AI Programming CAT 1 N CAT 2 Muchiri
14 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
MLlab Manual LIET
No ratings yet
MLlab Manual LIET
52 pages
ML Lab Observation
100% (1)
ML Lab Observation
44 pages
R20 Iii-Ii ML Lab Manual
100% (1)
R20 Iii-Ii ML Lab Manual
79 pages
Lab Manual
No ratings yet
Lab Manual
55 pages
AD3461_ML Lab Manual
No ratings yet
AD3461_ML Lab Manual
54 pages
Lab Manual
No ratings yet
Lab Manual
25 pages
MACHINE LEARNING LAB MANUAL (1)
No ratings yet
MACHINE LEARNING LAB MANUAL (1)
23 pages
Machine Learninf File Final
No ratings yet
Machine Learninf File Final
45 pages
ML Lab Experiments (1) - Pages-1
No ratings yet
ML Lab Experiments (1) - Pages-1
6 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
43 pages
PESIT Bangalore South Campus: Vii Semester Lab Manual Subject: Machine Learning
No ratings yet
PESIT Bangalore South Campus: Vii Semester Lab Manual Subject: Machine Learning
31 pages
IV - ML Lab
No ratings yet
IV - ML Lab
31 pages
ML Lab - 231009 - 210335
No ratings yet
ML Lab - 231009 - 210335
38 pages
22K61A0618_removed_lab manual sasi cld
No ratings yet
22K61A0618_removed_lab manual sasi cld
25 pages
1.implement FIND-S Algorithm: Desription
No ratings yet
1.implement FIND-S Algorithm: Desription
19 pages
Machine Learning Lab Record: Dr. Sarika Hegde
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
23 pages
Machine Learning Lab (17CSL76)
No ratings yet
Machine Learning Lab (17CSL76)
48 pages
ML Lab Manual (1-9)
No ratings yet
ML Lab Manual (1-9)
37 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
ML Lab Manual-99
No ratings yet
ML Lab Manual-99
23 pages
New ML Lab Manual
No ratings yet
New ML Lab Manual
29 pages
ML Lab
No ratings yet
ML Lab
9 pages
ML Lab Programs
No ratings yet
ML Lab Programs
18 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
ml final
No ratings yet
ml final
19 pages
ML Lab Prog1-5 (5) College PDF
No ratings yet
ML Lab Prog1-5 (5) College PDF
12 pages
Screenshot 2023-12-07 at 11.07.49 AM
No ratings yet
Screenshot 2023-12-07 at 11.07.49 AM
14 pages
AD3461-Machine Learning Lab Manual
No ratings yet
AD3461-Machine Learning Lab Manual
26 pages
Fedal #5
No ratings yet
Fedal #5
33 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
ML Lab Programs 1-10-Converted NAM COLLEGE PDF
No ratings yet
ML Lab Programs 1-10-Converted NAM COLLEGE PDF
33 pages
Ex 1 in ML
No ratings yet
Ex 1 in ML
4 pages
ML1 3 Merged
No ratings yet
ML1 3 Merged
19 pages
22K61A0654_2_sasi_auto
No ratings yet
22K61A0654_2_sasi_auto
24 pages
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
No ratings yet
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
20 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
Machine Learning Manual Final
No ratings yet
Machine Learning Manual Final
37 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
Edited - Edited - Final ML Lab Manual Version11
No ratings yet
Edited - Edited - Final ML Lab Manual Version11
83 pages
lab_program-2
No ratings yet
lab_program-2
4 pages
ML Lab Manual - Ex No. 1 To 9
No ratings yet
ML Lab Manual - Ex No. 1 To 9
26 pages
IT ML Lab
No ratings yet
IT ML Lab
35 pages
R20-21NM-III-I-ML-LAB MANUAL (1)
No ratings yet
R20-21NM-III-I-ML-LAB MANUAL (1)
38 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Candidate
No ratings yet
Candidate
4 pages
ML Record
No ratings yet
ML Record
18 pages
MLWP LAB Experiment's
No ratings yet
MLWP LAB Experiment's
11 pages
ML Manual
No ratings yet
ML Manual
34 pages
AD3461 ML lab manual
No ratings yet
AD3461 ML lab manual
32 pages
MLAll Practical
No ratings yet
MLAll Practical
27 pages
Machine Learning LAB MANUAL
No ratings yet
Machine Learning LAB MANUAL
23 pages
Find-S Algorithm
No ratings yet
Find-S Algorithm
1 page
Exp 4a
No ratings yet
Exp 4a
3 pages
LAB 1
No ratings yet
LAB 1
5 pages
6th-p3 conv
No ratings yet
6th-p3 conv
3 pages
ML New record (5)
No ratings yet
ML New record (5)
51 pages
ML Lab
No ratings yet
ML Lab
49 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Business Data Mining Week 11
No ratings yet
Business Data Mining Week 11
15 pages
Rice Leaf
No ratings yet
Rice Leaf
5 pages
Farm Fusion
No ratings yet
Farm Fusion
14 pages
Vazirani 2020
No ratings yet
Vazirani 2020
5 pages
CCST9017 (2023-24lecture11printed Version) MachineLearning
No ratings yet
CCST9017 (2023-24lecture11printed Version) MachineLearning
55 pages
ML Unit-2
No ratings yet
ML Unit-2
51 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Attribute Selection Presentation by - Rohit Ghosh
No ratings yet
Attribute Selection Presentation by - Rohit Ghosh
11 pages
UNIT 15
No ratings yet
UNIT 15
12 pages
Act9
No ratings yet
Act9
22 pages
Unit II Classification
No ratings yet
Unit II Classification
31 pages
2019 SCC361 Questions
No ratings yet
2019 SCC361 Questions
6 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML04_KNN-SVM_2024-2025
No ratings yet
ML04_KNN-SVM_2024-2025
57 pages
FINAL UNIT 4
No ratings yet
FINAL UNIT 4
107 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Soultion5
No ratings yet
Soultion5
3 pages
C2_W4_Decision_Tree_with_Markdown
No ratings yet
C2_W4_Decision_Tree_with_Markdown
17 pages
Unit-5_3161610
No ratings yet
Unit-5_3161610
92 pages
[1.2]
No ratings yet
[1.2]
58 pages
DSV Sem Exam
No ratings yet
DSV Sem Exam
15 pages
Data Pruning
No ratings yet
Data Pruning
52 pages
Dwdm-Lab Manual
No ratings yet
Dwdm-Lab Manual
39 pages
PDF Principles of Data Mining Undergraduate Topics in Computer Science Max Bramer Download
100% (3)
PDF Principles of Data Mining Undergraduate Topics in Computer Science Max Bramer Download
61 pages
ML Unit I
No ratings yet
ML Unit I
43 pages
2 Supervised Learning
No ratings yet
2 Supervised Learning
52 pages
Algorithm
No ratings yet
Algorithm
27 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.