0% found this document useful (0 votes)

14 views10 pages

Compute2

The document discusses performing different clustering algorithms on a dataset including K-means clustering for various values of K, fuzzy C-means clustering, bottom-up clustering using agglomerative clustering and generating dendrograms, and density-based clustering using DBSCAN. Clusters from each method are captured and compared in a summary.

Uploaded by

lil Aady

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views10 pages

Compute2

Uploaded by

lil Aady

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

1st task: perform data cleaning, if any, in the dataset.

As the data is already cleaned so there is no requirement of data cleaning

import numpy as np
import pandas as pd
import os

x = pd.read_csv('data.csv')

x.dropna()
print(x)

2nd task: perform K-means Clustering for K=3,5,7 and also Fuzzy C means. Capture the Clusters
generated with Both K Means & C means.

import pandas as pd
from sklearn.cluster import KMeans

from sklearn.preprocessing import StandardScaler

from sklearn.decomposition import PCA
from fcmeans import FCM

import matplotlib.pyplot as plt

z = pd.read_csv('data.csv')

X = z.iloc[:, 1:59].values

scale = StandardScaler()
X = scale.fit_transform(X)

pca = PCA(n_components=2)
X_pca = pca.fit_transform(X)

k_values = [3, 5, 7]
fuzzy_cmeans_c = [3, 5, 7]

for k in k_values:
kmeans = KMeans(n_clusters=k, random_state=42)
y_kmeans = kmeans.fit_predict(X_pca)

plt.figure(figsize=(6, 4))
plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y_kmeans, cmap='viridis')
plt.title('K-means clustering (K = ' + str(k) + ')')
plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.show()
for c in fuzzy_cmeans_c:
fcm = FCM(n_clusters=10, m=c)
fcm.fit(X_pca)
y_fcm = fcm.predict(X_pca)

plt.figure(figsize=(6, 4))
plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y_fcm, cmap='viridis')
plt.title('Fuzzy C means clustering (c = ' + str(c) + ')')
plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.show()

OUTPUT
3rd task: perform Bottom-up Clustering (Agglomerative clustering). Capture the Clusters generated
at a different level, and also prepare dendrograms.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import AgglomerativeClustering
from scipy.cluster.hierarchy import dendrogram, linkage

Data = pd.read_csv('data.csv')

mfs = np.array(Data.iloc[:, 1:59])

agg_clustering = AgglomerativeClustering(n_clusters=None, linkage='ward',

distance_threshold=0)
cluster = agg_clustering.fit_predict(mfs)

linked = linkage(mfs, method='ward')

dendrogram(linked, truncate_mode='lastp', p=30, orientation='top')
plt.show()
for i in range(2, 12):
clustering = AgglomerativeClustering(n_clusters=i, linkage='ward')
clustering.fit(mfs)
print(f'Clusters at level {i}: {clustering.labels_}')

# Observations:

# The dendogram shows hierarchy of clusters formed by the agglomerative

clustering algorithm
# The clusters start merging from the bottom level and go up to the top level.
# At the top level, we can see that all the data points belong to a single
cluster.
# By looking at the dendrogram, we can choose the appropriate level to get the
desired number of clusters.
# We can also see that at each level, the clustering algorithm forms a
different set of clusters based on the distance threshold and linkage
criterion used.
4rth task: perform density-based (DBSCAN) Clustering,
5th Task: prepare a brief Comparative summary of clusters generated using the above clustering
techniques.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import DBSCAN
from sklearn.decomposition import PCA

data = pd.read_csv('data.csv')

mfs = np.array(data.iloc[:, 1:-1])

dbscn_clustering = DBSCAN(eps=20, min_samples=5)

clusters = dbscn_clustering.fit_predict(mfs)

pc = PCA(n_components=2)
reduced_features = pc.fit_transform(mfs)

plt.scatter(reduced_features[:, 0], reduced_features[:, 1], c=clusters,

cmap='viridis')
plt.show()

# Observations:
# DBSCAN clustering algorithm forms clusters based on the density of the data
points.
# The resulting clusters are not well

Business Report Data Mining
91% (11)
Business Report Data Mining
18 pages
Design Technology and Innovation
50% (2)
Design Technology and Innovation
52 pages
ManageEngine OS Deployer
No ratings yet
ManageEngine OS Deployer
150 pages
Bredal B2 B8
100% (1)
Bredal B2 B8
42 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Clustering Algorithms CheatSheet 1710438661
No ratings yet
Clustering Algorithms CheatSheet 1710438661
6 pages
From Import Import As Import As From Import From Import From Import From Import
No ratings yet
From Import Import As Import As From Import From Import From Import From Import
9 pages
Clustering
No ratings yet
Clustering
1 page
23CC554
No ratings yet
23CC554
10 pages
AAM 7th prac
No ratings yet
AAM 7th prac
4 pages
IDM Assignment
No ratings yet
IDM Assignment
15 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
DWM_EXP4
No ratings yet
DWM_EXP4
9 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Practical 5
No ratings yet
Practical 5
6 pages
Week 8 DS Practical (1)
No ratings yet
Week 8 DS Practical (1)
13 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
Project Data Mining (AMAN YADAV)
No ratings yet
Project Data Mining (AMAN YADAV)
12 pages
assg 3
No ratings yet
assg 3
31 pages
Program 7
No ratings yet
Program 7
3 pages
Ass6(DMDS)
No ratings yet
Ass6(DMDS)
7 pages
vertopal.com_najir shaikh practical 5 ml 2 (1)
No ratings yet
vertopal.com_najir shaikh practical 5 ml 2 (1)
4 pages
Lab-7_Clustering
No ratings yet
Lab-7_Clustering
4 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Vid 4
No ratings yet
Vid 4
6 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Partition
No ratings yet
Partition
52 pages
4.cluster Analysis
No ratings yet
4.cluster Analysis
7 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
Assignment4_CH5650_CH21B112
No ratings yet
Assignment4_CH5650_CH21B112
3 pages
AML Clustering
No ratings yet
AML Clustering
7 pages
Banknote Authentication
100% (1)
Banknote Authentication
3 pages
Artificial Intelligence Report
No ratings yet
Artificial Intelligence Report
23 pages
exp_6
No ratings yet
exp_6
10 pages
D3 docs
No ratings yet
D3 docs
6 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
EXP-6 K Mean Clustring
No ratings yet
EXP-6 K Mean Clustring
6 pages
New K Means - Jupyter Notebook
No ratings yet
New K Means - Jupyter Notebook
4 pages
Experiment 11ml
No ratings yet
Experiment 11ml
1 page
sales-data-clustering
No ratings yet
sales-data-clustering
15 pages
Agglomerative - Jupyter Notebook
No ratings yet
Agglomerative - Jupyter Notebook
2 pages
Practical 03
No ratings yet
Practical 03
3 pages
Tutorial 8
No ratings yet
Tutorial 8
12 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
STAT452 Project1
No ratings yet
STAT452 Project1
13 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
LAB7_Kmeans[1]
No ratings yet
LAB7_Kmeans[1]
11 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Data Mining Project - Clustering - State Wise Health Income
No ratings yet
Data Mining Project - Clustering - State Wise Health Income
9 pages
Untitled document-2-1-13-7-11.4
No ratings yet
Untitled document-2-1-13-7-11.4
5 pages
Data Mining
No ratings yet
Data Mining
27 pages
Mids Practical 5
No ratings yet
Mids Practical 5
2 pages
Slip Clustering
No ratings yet
Slip Clustering
2 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
6 pages
clustering R codes
No ratings yet
clustering R codes
2 pages
TOO
No ratings yet
TOO
7 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
2 pages
Lesson 6 - Unsupervised Learning
No ratings yet
Lesson 6 - Unsupervised Learning
63 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
K++
No ratings yet
K++
5 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Mobile Commerce and Ubiquitous Computing: E-Business
No ratings yet
Mobile Commerce and Ubiquitous Computing: E-Business
43 pages
Permission Letter
No ratings yet
Permission Letter
1 page
CIM Report 2015
No ratings yet
CIM Report 2015
188 pages
Vagner, Alex.: Executive Summary
No ratings yet
Vagner, Alex.: Executive Summary
4 pages
Physical Verification Flow On Multiple Foundries
No ratings yet
Physical Verification Flow On Multiple Foundries
4 pages
Ms Ramya Swetha Paper For OU
No ratings yet
Ms Ramya Swetha Paper For OU
7 pages
Maxim Is Ing The Reuse and Recycling of Clothes and Textiles
No ratings yet
Maxim Is Ing The Reuse and Recycling of Clothes and Textiles
128 pages
Fiscal Metering Package (Pk-750-01) : Process Data Sheet
No ratings yet
Fiscal Metering Package (Pk-750-01) : Process Data Sheet
4 pages
Self-Audit of Process Performance
No ratings yet
Self-Audit of Process Performance
22 pages
P&G
100% (1)
P&G
7 pages
Installation Guide G210 InviCell
No ratings yet
Installation Guide G210 InviCell
16 pages
BASF Elastospray Booklet Eng
100% (1)
BASF Elastospray Booklet Eng
28 pages
Tcs Verbal Ability
No ratings yet
Tcs Verbal Ability
3 pages
OHS Workplace Inspection Workshop
No ratings yet
OHS Workplace Inspection Workshop
5 pages
Resume For Instructional Technology Coach
No ratings yet
Resume For Instructional Technology Coach
1 page
Design Thinking and Transformation Leadership
100% (2)
Design Thinking and Transformation Leadership
22 pages
Chapter2 Air Refrigeration Cycle
No ratings yet
Chapter2 Air Refrigeration Cycle
39 pages
Unit Conversion Factors
No ratings yet
Unit Conversion Factors
3 pages
Construction of Closet, Fence, Ceiling of Cabana and Floor Tiles of Toilet
No ratings yet
Construction of Closet, Fence, Ceiling of Cabana and Floor Tiles of Toilet
6 pages
Sustanibility Syllabus
No ratings yet
Sustanibility Syllabus
2 pages
Fluid Statistics
100% (1)
Fluid Statistics
66 pages
Filtro
No ratings yet
Filtro
4 pages
Materials Science in Semiconductor Processing: Sciencedirect
No ratings yet
Materials Science in Semiconductor Processing: Sciencedirect
7 pages
CV Rodrigo Moreno
No ratings yet
CV Rodrigo Moreno
4 pages
The Levels of Autonomous Driving: Dedicated Short-Range Communications (DSRC)
No ratings yet
The Levels of Autonomous Driving: Dedicated Short-Range Communications (DSRC)
4 pages
Term End Model Examination Question Paper - Fall - 2011-12: Use of The Statistical Tables Is Permitted
No ratings yet
Term End Model Examination Question Paper - Fall - 2011-12: Use of The Statistical Tables Is Permitted
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Compute2

Uploaded by

Compute2

Uploaded by

1st task: perform data cleaning, if any, in the dataset.

As the data is already cleaned so there is no requirement of data cleaning

from sklearn.preprocessing import StandardScaler

import matplotlib.pyplot as plt

mfs = np.array(Data.iloc[:, 1:59])

agg_clustering = AgglomerativeClustering(n_clusters=None, linkage='ward',

linked = linkage(mfs, method='ward')

# The dendogram shows hierarchy of clusters formed by the agglomerative

mfs = np.array(data.iloc[:, 1:-1])

dbscn_clustering = DBSCAN(eps=20, min_samples=5)

plt.scatter(reduced_features[:, 0], reduced_features[:, 1], c=clusters,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.