0% found this document useful (0 votes)

3 views

DMDW Lab8

The document outlines the K-means clustering algorithm, explaining its purpose, methodology, and challenges in determining the optimal number of clusters. It provides a Python implementation of the K-means algorithm using the Iris dataset, including data scaling, cluster assignment, and centroid calculation. Additionally, it visualizes the clustering results and displays the centroids of the clusters.

Uploaded by

jagnoorsm.cs.22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

DMDW Lab8

Uploaded by

jagnoorsm.cs.22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

LAB - 8

1. Demonstrate the K-means clustering using the WEKA tool.

K-Means clustering is an unsupervised learning algorithm used to partition data into k distinct
clusters based on similarity. The algorithm works by first randomly selecting k initial centroids,
then iteratively assigning each data point to the nearest centroid, and recalculating the centroids
as the mean of all points in each cluster. This process is repeated until the centroids no longer
change significantly, signaling convergence. The goal is to minimize the sum of squared
distances between data points and their assigned centroids, which is known as the objective
function or inertia.

One of the challenges in K-Means is determining the optimal number of clusters, k. Methods like
the Elbow Method, Silhouette Score, and Gap Statistic help identify a good value for k by
assessing how well the data is grouped. While K-Means is computationally efficient and widely
used in various applications, it has some limitations, such as sensitivity to the initialization of
centroids, difficulty handling non-spherical clusters, and its sensitivity to outliers. Despite these
drawbacks, K-Means remains a powerful tool for clustering in fields like customer segmentation,
document clustering, and image compression.
2. Implement K-means clustering algorithm using python.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.preprocessing import StandardScaler

iris = load_iris()
X = iris.data
y = iris.target

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

class KMeans:
def __init__(self, n_clusters=3, max_iters=100):
self.n_clusters = n_clusters
self.max_iters = max_iters

def fit(self, X):

random_idx = np.random.permutation(len(X))[:self.n_clusters]
self.centroids = X[random_idx]

for _ in range(self.max_iters):
self.labels = self._assign_clusters(X)
new_centroids = self._calculate_centroids(X)
if np.all(new_centroids == self.centroids):
break
self.centroids = new_centroids

def _assign_clusters(self, X):

distances = np.linalg.norm(X[:, np.newaxis] - self.centroids, axis=2)
return np.argmin(distances, axis=1)

def _calculate_centroids(self, X):

centroids = np.zeros((self.n_clusters, X.shape[1]))
for i in range(self.n_clusters):
if np.any(self.labels == i):
centroids[i] = X[self.labels == i].mean(axis=0)
return centroids

def predict(self, X):

return self._assign_clusters(X)

kmeans = KMeans(n_clusters=3)
kmeans.fit(X_scaled)

plt.scatter(X_scaled[:, 0], X_scaled[:, 1], c=kmeans.labels, cmap='viridis',

marker='o')
plt.scatter(kmeans.centroids[:, 0], kmeans.centroids[:, 1], c='red', marker='x',
s=100)
plt.title('K-Means Clustering on Iris Dataset (First Two Features)')
plt.xlabel('Feature 1')
plt.ylabel('Feature 2')
plt.show()

centroids_df = pd.DataFrame(kmeans.centroids, columns=['Feature 1', 'Feature 2',

'Feature 3', 'Feature 4'])
centroids_df.index.name = 'Cluster'
print("\nCentroids:")
print(centroids_df)

E-Motors Business Plan
No ratings yet
E-Motors Business Plan
28 pages
BS en 12542-2020
100% (2)
BS en 12542-2020
72 pages
Function Generator Project Using IC741
100% (3)
Function Generator Project Using IC741
5 pages
Fidelis Endpoint: SIEM Integrations Guide
No ratings yet
Fidelis Endpoint: SIEM Integrations Guide
53 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
K Means
No ratings yet
K Means
3 pages
DS - ML - 7 - 60019210046 1
No ratings yet
DS - ML - 7 - 60019210046 1
6 pages
K++
No ratings yet
K++
5 pages
ML 7
No ratings yet
ML 7
2 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Clustering
No ratings yet
Clustering
1 page
Lab-7_Clustering
No ratings yet
Lab-7_Clustering
4 pages
Assignment # 1: Performance Timeline of Flynn Taxonomy
No ratings yet
Assignment # 1: Performance Timeline of Flynn Taxonomy
21 pages
SE_KMeansClustering
No ratings yet
SE_KMeansClustering
21 pages
09.unsupervised Learning
No ratings yet
09.unsupervised Learning
50 pages
Lab6 instruction (1)
No ratings yet
Lab6 instruction (1)
3 pages
k means
No ratings yet
k means
4 pages
Data Science Analysis Final Project
No ratings yet
Data Science Analysis Final Project
10 pages
Detecting Patterns with Unsupervised Learning
No ratings yet
Detecting Patterns with Unsupervised Learning
21 pages
DOC-20250407-WA0033.
No ratings yet
DOC-20250407-WA0033.
38 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
DWM_EXP4
No ratings yet
DWM_EXP4
9 pages
AML - LAB (1-6)
No ratings yet
AML - LAB (1-6)
15 pages
Report 1
No ratings yet
Report 1
3 pages
MSC575 - Sabih - Uddin - WEEK 8 - LAB PDF
No ratings yet
MSC575 - Sabih - Uddin - WEEK 8 - LAB PDF
400 pages
Assignment 6 ML
No ratings yet
Assignment 6 ML
4 pages
KMEANS
No ratings yet
KMEANS
9 pages
Lecture 4
No ratings yet
Lecture 4
64 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
K Means
100% (2)
K Means
329 pages
AAM 7th prac
No ratings yet
AAM 7th prac
4 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
Building K-Means Clustering Algorithm From Scratch
No ratings yet
Building K-Means Clustering Algorithm From Scratch
10 pages
machine learning lab
No ratings yet
machine learning lab
20 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
INTRO TO ML ASS
No ratings yet
INTRO TO ML ASS
3 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
AML Clustering
No ratings yet
AML Clustering
7 pages
DA_EXP_10 (1)
No ratings yet
DA_EXP_10 (1)
6 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Ds Paper
No ratings yet
Ds Paper
35 pages
DA_EXP_10_66
No ratings yet
DA_EXP_10_66
6 pages
EXPERIMENT 9
No ratings yet
EXPERIMENT 9
10 pages
MLT Unit 3 Notes
No ratings yet
MLT Unit 3 Notes
19 pages
3.1 K - Means
No ratings yet
3.1 K - Means
16 pages
Experiment No 7
No ratings yet
Experiment No 7
4 pages
DA_EXP_10
No ratings yet
DA_EXP_10
6 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
Assignment 4 A
No ratings yet
Assignment 4 A
15 pages
K Mean Clustering
No ratings yet
K Mean Clustering
27 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
CV UNIT 4
No ratings yet
CV UNIT 4
60 pages
Unit-4
No ratings yet
Unit-4
46 pages
k Mean Clustering
No ratings yet
k Mean Clustering
32 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Grade 6 Class Program
No ratings yet
Grade 6 Class Program
2 pages
Hadi 2019 1
No ratings yet
Hadi 2019 1
11 pages
ACTION OF WIND AND WATER IN ARID AREAS
No ratings yet
ACTION OF WIND AND WATER IN ARID AREAS
155 pages
Lect 8 Simplex Method - 1
No ratings yet
Lect 8 Simplex Method - 1
32 pages
dlp-2019 5
No ratings yet
dlp-2019 5
1 page
Get (Ebook PDF) Service Management: Operations, Strategy, Information Technology 9th Edition Free All Chapters
75% (4)
Get (Ebook PDF) Service Management: Operations, Strategy, Information Technology 9th Edition Free All Chapters
51 pages
West Bengal State University: B.Sc./Part-I/Hons./CEMA-II/2017
No ratings yet
West Bengal State University: B.Sc./Part-I/Hons./CEMA-II/2017
4 pages
PDF (Ebook PDF) Accounting Information Systems 10th Edition Download
100% (4)
PDF (Ebook PDF) Accounting Information Systems 10th Edition Download
34 pages
A levels Zimsec HBC project titles by Sugar Boy
100% (1)
A levels Zimsec HBC project titles by Sugar Boy
14 pages
RBC Storage Lesions:: What They Are, and How We Can Minimize Them
No ratings yet
RBC Storage Lesions:: What They Are, and How We Can Minimize Them
53 pages
Results and Discussions
No ratings yet
Results and Discussions
4 pages
Corporate Governance Mechanisms and Firm Performance A Survey of Literature
No ratings yet
Corporate Governance Mechanisms and Firm Performance A Survey of Literature
16 pages
Differences in Uk and Us Systems
No ratings yet
Differences in Uk and Us Systems
1 page
Pge Sample Utility Bill - 2
No ratings yet
Pge Sample Utility Bill - 2
8 pages
Ecology and Systematic Zoology Advanced Animal Ecology
No ratings yet
Ecology and Systematic Zoology Advanced Animal Ecology
27 pages
Time Value of Money &capital Budgeting Decisions
No ratings yet
Time Value of Money &capital Budgeting Decisions
13 pages
Download Complete Museum Bodies The Politics and Practices of Visiting and Viewing 1st Edition Helen Rees Leahy PDF for All Chapters
100% (2)
Download Complete Museum Bodies The Politics and Practices of Visiting and Viewing 1st Edition Helen Rees Leahy PDF for All Chapters
76 pages
Slickline Care 3
100% (3)
Slickline Care 3
22 pages
Pulse Crop Book
No ratings yet
Pulse Crop Book
24 pages
Use of A Supplemental Feeding Tube Device And.7
No ratings yet
Use of A Supplemental Feeding Tube Device And.7
7 pages
Reading
No ratings yet
Reading
5 pages
People v. BArtolome - Convicted-Buy Bust Operation
No ratings yet
People v. BArtolome - Convicted-Buy Bust Operation
11 pages
[RM 07] KGK Subba Rao (2003) - Indian Statistical System at Crossroads
0% (1)
[RM 07] KGK Subba Rao (2003) - Indian Statistical System at Crossroads
5 pages
Effect of Flooding On Property Value A Case Study of Isheri North, Isheri, Lagos State
No ratings yet
Effect of Flooding On Property Value A Case Study of Isheri North, Isheri, Lagos State
7 pages
WESTPAK Laboratory Package Drop Testing v2.1-1
No ratings yet
WESTPAK Laboratory Package Drop Testing v2.1-1
11 pages
10 Simple C++ Programs
No ratings yet
10 Simple C++ Programs
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DMDW Lab8

Uploaded by

DMDW Lab8

Uploaded by

LAB - 8

1. Demonstrate the K-means clustering using the WEKA tool.

def fit(self, X):

def _assign_clusters(self, X):

def _calculate_centroids(self, X):

def predict(self, X):

plt.scatter(X_scaled[:, 0], X_scaled[:, 1], c=kmeans.labels, cmap='viridis',

centroids_df = pd.DataFrame(kmeans.centroids, columns=['Feature 1', 'Feature 2',

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.