0% found this document useful (0 votes)

6 views9 pages

What Is Unsupervised Learning

Unsupervised Learning involves machines learning from unlabeled data to identify patterns and group similar items without supervision. It includes techniques like clustering and association, with clustering further divided into hierarchical and partitioning methods, such as K-Means. K-Means clustering specifically requires a predefined number of clusters (K) and iteratively assigns data points to the nearest centroid until convergence.

Uploaded by

lakshmidharshan.30

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

What Is Unsupervised Learning

Uploaded by

lakshmidharshan.30

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

What is Unsupervised Learning?

In Unsupervised Learning, the machine uses unlabeled data and learns on itself
without any supervision. The machine tries to find a pattern in the unlabeled data
and gives a response.

Let's take a similar example is before, but this time we do not tell the machine
whether it's a spoon or a knife. The machine identifies patterns from the given set
and groups them based on their patterns, similarities, etc.

Unsupervised learning can be further grouped into types:

1. Clustering

2. Association

Clustering - Unsupervised Learning

Clustering is the method of dividing the objects into clusters that are similar
between them and are dissimilar to the objects belonging to another cluster. For
example, finding out which customers made similar product purchases.
Suppose a telecom company wants to reduce its customer churn rate by
providing personalized call and data plans. The behavior of the customers is
studied and the model segments the customers with similar traits. Several
strategies are adopted to minimize churn rate and maximize profit through
suitable promotions and campaigns.

On the right side of the image, you can see a graph where customers are
grouped. Group A customers use more data and also have high call durations.
Group B customers are heavy Internet users, while Group C customers have
high call duration. So, Group B will be given more data benefit plants, while
Group C will be given cheaper called call rate plans and group A will be given the
benefit of both.

Types of Clustering

Clustering is a type of unsupervised learning wherein data points are grouped

into different sets based on their degree of similarity.

The various types of clustering are:

 Hierarchical clustering

 Partitioning clustering

Hierarchical clustering is further subdivided into:

 Agglomerative clustering

 Divisive clustering

Partitioning clustering is further subdivided into:

 K-Means clustering

 Fuzzy C-Means clustering

k-means Clustering Hierarchical Clustering

k-means, using a pre-specified number

of clusters, the method assigns records to
each cluster to find the mutually
exclusive cluster of spherical shape Hierarchical methods can be either divisive or
based on distance. agglomerative.

K Means clustering needed advance In hierarchical clustering one can stop at any
knowledge of K i.e. no. of clusters one number of clusters, one find appropriate by
want to divide your data. interpreting the dendrogram.

Agglomerative methods begin with ‘n’ clusters

One can use median or mean as a cluster and sequentially combine similar clusters until
centre to represent each cluster. only one cluster is obtained.

Divisive methods work in the opposite

direction, beginning with one cluster that
Methods used are normally less includes all the records and Hierarchical
computationally intensive and are suited methods are especially useful when the target is
with very large datasets. to arrange the clusters into a natural hierarchy.

In K Means clustering, since one start

with random choice of clusters, the
results produced by running the algorithm In Hierarchical Clustering, results are
many times may differ. reproducible in Hierarchical clustering

K- means clustering a simply a division A hierarchical clustering is a set of nested

of the set of data objects into non-
overlapping subsets (clusters) such that
each data object is in exactly one subset). clusters that are arranged as a tree.

K Means clustering is found to work well

when the structure of the clusters is hyper Hierarchical clustering don’t work as well as, k
spherical (like circle in 2D, sphere in means when the shape of the clusters is hyper
3D). spherical.

Advantages: 1. Convergence is Advantages: 1 .Ease of handling of any forms

guaranteed. 2. Specialized to clusters of of similarity or distance. 2. Consequently,
different sizes and shapes. applicability to any attributes types.

Disadvantage: 1. Hierarchical clustering

Disadvantages: 1. K-Value is difficult to requires the computation and storage of an n×n
predict 2. Didn’t work well with global distance matrix. For very large datasets, this can
cluster. be expensive and slow

Hierarchical Clustering

Hierarchical clustering uses a tree-like structure, like so:

In agglomerative clustering, there is a bottom-up approach. We begin with each

element as a separate cluster and merge them into successively more massive
clusters, as shown below:
Divisive clustering is a top-down approach. We begin with the whole set and
proceed to divide it into successively smaller clusters, as you can see below:

Partitioning Clustering

Partitioning clustering is split into two subtypes - K-Means clustering and Fuzzy
C-Means.

In k-means clustering, the objects are divided into several clusters mentioned by
the number ‘K.’ So if we say K = 2, the objects are divided into two clusters, c1
and c2, as shown:

Here, the features or characteristics are compared, and all objects having similar characteristics
are clustered together.
Fuzzy c-means is very similar to k-means in the sense that it clusters objects that have similar
characteristics together. In k-means clustering, a single object cannot belong to two different
clusters. But in c-means, objects can belong to more than one cluster, as shown.

What is Meant by the K-Means Clustering Algorithm?

K-Means clustering is an unsupervised learning algorithm. There is no labeled data for this
clustering, unlike in supervised learning. K-Means performs the division of objects into clusters
that share similarities and are dissimilar to the objects belonging to another cluster.

The term ‘K’ is a number. You need to tell the system how many clusters you need to create. For
example, K = 2 refers to two clusters. There is a way of finding out what is the best or optimum
value of K for a given data.

For a better understanding of k-means, let's take an example from cricket. Imagine you received
data on a lot of cricket players from all over the world, which gives information on the runs
scored by the player and the wickets taken by them in the last ten matches. Based on this
information, we need to group the data into two clusters, namely batsman and bowlers.

Let's take a look at the steps to create these clusters.

Solution:

Assign data points

Here, we have our data set plotted on ‘x’ and ‘y’ coordinates. The information on the y-axis is
about the runs scored, and on the x-axis about the wickets taken by the players.

If we plot the data, this is how it would look:

Perform Clustering

We need to create the clusters, as shown below:

Considering the same data set, let us solve the problem using K-Means clustering (taking K = 2).

The first step in k-means clustering is the allocation of two centroids randomly (as K=2). Two
points are assigned as centroids. Note that the points can be anywhere, as they are random points.
They are called centroids, but initially, they are not the central point of a given data set.

The next step is to determine the distance between each of the randomly assigned centroids' data
points. For every point, the distance is measured from both the centroids, and whichever distance
is less, that point is assigned to that centroid. You can see the data points attached to the
centroids and represented here in blue and yellow.
The next step is to determine the actual centroid for these two clusters. The
original randomly allocated centroid is to be repositioned to the actual centroid of
the clusters.

This process of calculating the distance and repositioning the centroid continues
until we obtain our final cluster. Then the centroid repositioning stops.

As seen above, the centroid doesn't need anymore repositioning, and it means
the algorithm has converged, and we have the two clusters with a centroid.

Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Unit 4
No ratings yet
Unit 4
74 pages
Module 5
No ratings yet
Module 5
91 pages
Clustering and K-Means Algorithm
No ratings yet
Clustering and K-Means Algorithm
81 pages
Clustering
No ratings yet
Clustering
84 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
04-FSSR DS610 2024 2025T1 Kmeans
No ratings yet
04-FSSR DS610 2024 2025T1 Kmeans
57 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
59 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
Clustering
No ratings yet
Clustering
38 pages
K Mean Clustering
No ratings yet
K Mean Clustering
59 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Lec 05 Unsupervised-Kmeans
No ratings yet
Lec 05 Unsupervised-Kmeans
50 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
M3 - Unsupervised Machine Learning
No ratings yet
M3 - Unsupervised Machine Learning
35 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
Unit 4
No ratings yet
Unit 4
96 pages
Unit - Iv Unsupervisied Learning - Notes
No ratings yet
Unit - Iv Unsupervisied Learning - Notes
32 pages
Chapter 3 p4
No ratings yet
Chapter 3 p4
18 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
ML Mod 4 Part 1
No ratings yet
ML Mod 4 Part 1
99 pages
Clustering
No ratings yet
Clustering
20 pages
IT3080 Lecture04 2023
No ratings yet
IT3080 Lecture04 2023
56 pages
ML L14 Clustering
No ratings yet
ML L14 Clustering
59 pages
Unit 4
No ratings yet
Unit 4
125 pages
Lecture+Notes+ +clustering
No ratings yet
Lecture+Notes+ +clustering
13 pages
Lecture Unsupervised (17!04!2024)
No ratings yet
Lecture Unsupervised (17!04!2024)
61 pages
Clustering
No ratings yet
Clustering
9 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
K-Means Clustering
No ratings yet
K-Means Clustering
8 pages
Unit 5
No ratings yet
Unit 5
5 pages
Unsupervised Learning Part 1
No ratings yet
Unsupervised Learning Part 1
9 pages
Clustering
No ratings yet
Clustering
10 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
Lecture Notes - Clustering
No ratings yet
Lecture Notes - Clustering
13 pages
Cluster Evaluation Techniques: Atds Assignment
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
4 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Unit 4
No ratings yet
Unit 4
40 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
No ratings yet
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
19 pages
Unit-5 Clustering (March 16, 24)
No ratings yet
Unit-5 Clustering (March 16, 24)
25 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
Applied Mathematics For Business, Economics and Social Science
55% (40)
Applied Mathematics For Business, Economics and Social Science
15 pages
Text Analytics Unit-3
No ratings yet
Text Analytics Unit-3
11 pages
Artificial Intelligence Lec 5
No ratings yet
Artificial Intelligence Lec 5
20 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
Unit - 4 (ML)
No ratings yet
Unit - 4 (ML)
13 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
17 pages
Fuzzy Meaning
No ratings yet
Fuzzy Meaning
6 pages
Reccurences General
No ratings yet
Reccurences General
22 pages
K Mean
No ratings yet
K Mean
7 pages
IRS Unit-5
No ratings yet
IRS Unit-5
62 pages
Karnaugh Maps
No ratings yet
Karnaugh Maps
10 pages
Python Lab
No ratings yet
Python Lab
33 pages
Module 4
No ratings yet
Module 4
20 pages
10 (MM) - Quadratics Test 1 2020 Final - SOLUTIONS
No ratings yet
10 (MM) - Quadratics Test 1 2020 Final - SOLUTIONS
8 pages
Hierarchical Link Analysis For Ranking W
No ratings yet
Hierarchical Link Analysis For Ranking W
44 pages
Ashwin P-Faults in PLA
No ratings yet
Ashwin P-Faults in PLA
22 pages
Naive Bayes
No ratings yet
Naive Bayes
24 pages
Matlab Code For Radial Basis Functions
100% (2)
Matlab Code For Radial Basis Functions
13 pages
Time Series Analysis: Applied Econometrics Prof. Dr. Simone Maxand
No ratings yet
Time Series Analysis: Applied Econometrics Prof. Dr. Simone Maxand
124 pages
Transportation Group Project
No ratings yet
Transportation Group Project
41 pages
Speech Emotion Recognition Using Deep Learning Hybrid Models
No ratings yet
Speech Emotion Recognition Using Deep Learning Hybrid Models
5 pages
FINAL General Maths PSMT
No ratings yet
FINAL General Maths PSMT
16 pages
WBUT Numerical Method Paper 2012
No ratings yet
WBUT Numerical Method Paper 2012
7 pages
CMPAssignment VII
No ratings yet
CMPAssignment VII
9 pages
Entanglement Harvesting and Divergences in Quadratic Unruh-Dewitt Detectors Pairs
No ratings yet
Entanglement Harvesting and Divergences in Quadratic Unruh-Dewitt Detectors Pairs
17 pages
Topic 1 - Descriptive and Inferential Statistics
No ratings yet
Topic 1 - Descriptive and Inferential Statistics
3 pages
Question Bank
No ratings yet
Question Bank
2 pages
Information & Network Security - Midterm Exam Spring 2016 Solutions
No ratings yet
Information & Network Security - Midterm Exam Spring 2016 Solutions
7 pages
SAR ATR With Full-Angle Data Augmentation and Feat
No ratings yet
SAR ATR With Full-Angle Data Augmentation and Feat
5 pages
Central Limit Theorem For The Realized Volatility Based On Tick Time Sampling - Slides
No ratings yet
Central Limit Theorem For The Realized Volatility Based On Tick Time Sampling - Slides
21 pages
Numerical Integration Test-1
No ratings yet
Numerical Integration Test-1
1 page
Fast Lempel-ZIV (LZ'78) Algorithm Using Codebook Hashing: Megha Atwal, Lovnish Bansal
No ratings yet
Fast Lempel-ZIV (LZ'78) Algorithm Using Codebook Hashing: Megha Atwal, Lovnish Bansal
4 pages
Optimization Gradient Descent Method
No ratings yet
Optimization Gradient Descent Method
3 pages
Tulsiramji Gaikwad-Patil College of Engineering & Technology Department of Computer Science & Engineering Session 2012-2013 Question Bank
No ratings yet
Tulsiramji Gaikwad-Patil College of Engineering & Technology Department of Computer Science & Engineering Session 2012-2013 Question Bank
11 pages
ATC Calculation Using PTDF
100% (1)
ATC Calculation Using PTDF
5 pages
SCSA3016 Data Science L T P Credits Total Marks 3 0 0 3 100
No ratings yet
SCSA3016 Data Science L T P Credits Total Marks 3 0 0 3 100
1 page
Chapter 8 Homework Assignment Ver 1.0
No ratings yet
Chapter 8 Homework Assignment Ver 1.0
2 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

What Is Unsupervised Learning

Uploaded by

What Is Unsupervised Learning

Uploaded by

What is Unsupervised Learning?

Unsupervised learning can be further grouped into types:

Clustering - Unsupervised Learning

Clustering is a type of unsupervised learning wherein data points are grouped

The various types of clustering are:

Hierarchical clustering is further subdivided into:

Partitioning clustering is further subdivided into:

 Fuzzy C-Means clustering

k-means Clustering Hierarchical Clustering

k-means, using a pre-specified number

Agglomerative methods begin with ‘n’ clusters

Divisive methods work in the opposite

In K Means clustering, since one start

K- means clustering a simply a division A hierarchical clustering is a set of nested

K Means clustering is found to work well

Advantages: 1. Convergence is Advantages: 1 .Ease of handling of any forms

Disadvantage: 1. Hierarchical clustering

Hierarchical clustering uses a tree-like structure, like so:

In agglomerative clustering, there is a bottom-up approach. We begin with each

What is Meant by the K-Means Clustering Algorithm?

Let's take a look at the steps to create these clusters.

Assign data points

If we plot the data, this is how it would look:

We need to create the clusters, as shown below:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.