Content-Length: 3113410 | pFad | https://www.scribd.com/document/753809418/4-Unsupervised-Learning-Model-Clustering
54.unsupervised Learning Model-Clustering
4.unsupervised Learning Model-Clustering
4.unsupervised Learning Model-Clustering
Unsupervised
Learning
Model-Clustering
Name : Sneha S. Pawar
Unsupervised Learning
Unsupervised learning models are used when we only have the input
variables (X) and no corresponding output variables.
They use unlabelled training data to model the underlying structure
of the data. Input data is given and the model is run on it. The image
or the input given are mixed together and insights on the inputs can
be found .
The model learns through observation and finds structures in the
data. Once the model is given a dataset, it automatically finds
patterns and relationships in the dataset by creating clusters in it.
What it cannot do is add labels to the cluster, like it cannot say this a
group of a group of apples or mangoes, but it will separate all the
apples from mangoes.
Step-5: Once all the clusters are combined into one big cluster,
develop the dendrogram to divide the clusters as per the
problem.
❑ Measure for the distance between two clusters
As we have seen, the closest distance between the two clusters is
crucial for the hierarchical clustering. There are various ways to
calculate the distance between two clusters, and these ways decide
the rule for clustering. These measures are called Linkage
methods. Some of the popular linkage methods are given below:
• Density connected:
A point i refers to density
connected to a point j with respect
to Eps, MinPts if there is a point o
such that both i and j are considered
as density reachable from o with
respect to Eps and MinPts.
❑ Working of Density-Based Clustering
• Suppose a set of objects is denoted by D', we can say that an object
I is directly density reachable form the object j only if it is located
within the ε neighborhood of j, and j is a core object.
• An object i is density reachable form the object j with respect to ε
and MinPts in a given set of objects, D' only if there is a sequence
of object chains point i1,…., in, i1 = j, pn = i such that ii + 1 is
directly density reachable from ii with respect to ε and MinPts.
• An object i is density connected object j with respect to ε and
MinPts in a given set of objects, D' only if there is an object o
belongs to D such that both point i and j are density reachable from
o with respect to ε and MinPts.
❑ Major Features of Density-Based Clustering
The primary features of Density-based clustering are given below.
• It is a scan method.
• It requires density parameters as a termination condition.
• It is used to manage noise in data clusters.
• Density-based clustering is used to identify clusters of arbitrary
size.
❑ Density-Based Clustering Methods DBSCAN
• DBSCAN stands for Density-Based Spatial Clustering of
Applications with Noise. It depends on a density-based notion of
cluster. It also identifies clusters of arbitrary size in the spatial
database with outliers.
❑ OPTICS
• OPTICS stands for Ordering Points To Identify the Clustering
Structure. It gives a significant order of database with respect to its
density-based clustering structure. The order of the cluster
comprises information equivalent to the density-based clustering
related to a long range of parameter settings. OPTICS methods are
beneficial for both automatic and interactive cluster analysis,
including determining an intrinsic clustering structure.
❑ DENCLUE
• Density-based clustering by Hinnebirg and Kiem. It enables a
compact mathematical description of arbitrarily shaped clusters in
high dimension state of data, and it is good for data sets with a huge
amount of noise.
Centroid-based methods:
This is basically one of the iterative clustering algorithms in which the
clusters are formed by the closeness of data points to the centroid of
clusters. Here, the cluster center i.e. centroid is formed such that the
distance of data points is minimum with the center. This problem is
basically one of the NP-Hard problems and thus solutions are
commonly approximated over a number of trials.
Fetched URL: https://www.scribd.com/document/753809418/4-Unsupervised-Learning-Model-Clustering
Alternative Proxies: