NI
NI
NIVEDITHA. GH
VI Semester
MACHINE LEARNING
Unsupervised
learningUNSUPERVISED LEARNING
Unsupervised learning is a type of machine learning where the algorithm learns patterns and
structures from input data without explicit supervision or labeled responses.
In unsupervised learning, the algorithm is given a dataset consisting of features but not the
corresponding target labels. The goal is to explore the inherent structure of the data, identify
patterns, and extract meaningful insights.
Unsupervised Learning is the training of a machine using information that is neither classified nor
labeled , which allows the algorithm to act on that information without guidance.
TYPES OF UNSUPERVISED LEARNING
CLUSTERING: ASSOCIATION:
IMAGE AND TEXT CLUSTERING: It can automatically group similar images and
texts , aiding in tasks like image organization, document clustering or content
recommendation.
APPLICATIONS CUSTOMER SEGMENTATION: Customers can be grouped based on their purchase
OF behavior, allowing organizations to customize marketing efforts.
UUNSUPERVISE SEMANTIC CLUSTERING: It organizes all the responses with the same meaning
into clusters to ensure that customer quickly and easily gets the information.
D LEARNING
ANOMALY DETECTION: Unsupervised learning is used to identify data
points, events, and/or observations that deviate from a dataset's normal
behavior.
PARTIONING BASED CLUSTERING: These methods divide the data set into distinct groups. They aim to create partitions
such that objects within each partition are as similar as possible.
TYPES OF CLUSTERING
•K-means Clustering: Partitions data into k clusters based on centroid minimization.
•K-medoids Clustering: Similar to k-means but uses medoids (most centrally located points) instead of centroids.
HIERARCHICAL CLUSTERING: These methods create a hierarchy of clusters that can be represented in a tree structure
(dendrogram).
•Agglomerative (Bottom-Up): Starts with each object in its own cluster and merges the closest pairs of clusters iteratively.
•Divisive (Top-Down): Starts with all objects in one cluster and splits the cluster iteratively.
DENSITY-BASED CLUSTERING: These methods find clusters based on the density of points in the data space.
•DBSCAN (Density-Based Spatial Clustering of Applications with Noise): Finds clusters based on the density of data points,
capable of finding arbitrary-shaped clusters and dealing with noise.
•OPTICS (Ordering Points to Identify the Clustering Structure): An extension of DBSCAN that works better with varied
densities.
GRID BASED CLUSTERING: These methods involve dividing the data space into a finite number of cells that form a grid
structure.
•STING (Statistical Information Grid): Uses a multi-resolution grid data structure for efficient clustering.
•CLIQUE (Clustering In QUEst): Identifies dense regions in subspaces of high-dimensional data and merges them to form
clusters.
APPLICATIONS
MARKETING: Clustering can be used for marketing reasons to characterize and discover
client segments.
BIOLOGY: It can be used for classification among different species of plants and animals.
IMAGE PROCESSING: It can be used to group similar images together, classify images based
on content and identify patterns in image data.
CLIMATE ANALYSIS: It is used for grouping comparable patterns of climate data, such as
temperature, precipitation and wind inorder to better understand climate change and its
influence on environment.
MEDICAL DIAGNOSIS: It is used to group patients with similar symptoms or diseases, Which helps in making accurate diagnoses and
identifying effective treatments.
K-MEANS CLUSTERING
FINANCE: Clustering is technique used to identify market groupings based on customer behavior , find patterns in stock market data
and assess risk in investment portfolios.
K-means clustering is a very popular clustering algorithm which applied when we have a dataset with labels
unknown.
The goal is to find certain groups based on some kind of similarity in the data with the number of groups
represented by K.
This algorithm is generally used in areas like market segmentation, customer segmentation, etc. But, it can
also be used to segment different objects in the images on the basis of the pixel values.