Unit 5
Unit 5
Density-based clusters
• Conceptual Clusters:we can define a cluster as a set of
objects that share some property
Conceptual clusters
K-means
• K-means defines a prototype in terms of a centroid, which is
usually the mean of a group of points, and is typically applied
to objects in a continuous n-dimensional space
The Basic K-means Algorithm
• K-means is simple and can be used for a wide variety of data types.
• Any two core points that are close enough—within a distance Eps
of one another—are put in the same cluster.
• Likewise, any border point that is close enough to a core point is
put in the same cluster as the core point.
• Noise points are discarded.
Time and Space Complexity
• The basic time complexity of the DBSCAN algorithm is O(m ×
time to find points in the Eps-neighborhood), where m is the
number of points.
• The space requirement of DBSCAN, even for high-dimensional
data, is O(m)
Strengths and Weaknesses