Unit 5 Notes DWM
Unit 5 Notes DWM
Solution:
Calculate min_supp=0.5*4=2 (support count is 2)
(0.5: given minimum support in problem, 4: total transactions in database D)
Step 1: Generate candidate list C1 from D
C1=
Step 2: Scan D for count of each candidate and find the support.
C1=
Step 8: Scan D for count of each candidate and find the support.
C3=
We start with a frequent itemset {a,b,c,d} and start forming rules with just one
consequent. Remove the rules failing to satisfy the minconf condition. Now,
start forming rules using a combination of consequents from the remaining
ones. Keep repeating until only one item is left on antecedent. This process has
to be done for all frequent itemsets.
Q.] Describe association rules in mining frequent patterns and cluster analysis.
Q.] Explain Apriori algorithms for frequent itemset using candidate generation.
Q.] Describe Cluster Analysis
Cluster Analysis:
Clustering is a data mining technique used to place data elements into related
groups without advance knowledge.
Clustering is the process of grouping a set of data objects into multiple groups
or clusters so that objects within a cluster have high similarity, but are very
dissimilar to objects in other clusters.
Dissimilarities and similarities are assessed based on the attribute values
describing the objects and often involve distance measures.
Cluster analysis or simply clustering is the process of partitioning a set of data
objects (or observations) into subsets.
Each subset is a cluster, such that objects in a cluster are similar to one another,
yet dissimilar to objects in other clusters. The set of clusters resulting from a
cluster analysis can be referred to as a clustering.
Answer:
Set of values: 2,3,6,8,9,12,15,18,22
1. Break given set of values randomly in to 3 clusters and calculate the mean
value.
K1: 2,8,15 mean=8.3
K2: 3,9,18 mean=10
K3: 6,12,22 mean=13.3
2. Reassign the values to clusters as per the mean calculated and calculate the
mean again.
K1: 2,3,6,8,9 mean=5.6
K2: mean=0
K3: 12,15,18,22 mean=16.75
3. Reassign the values to clusters as per the mean calculated and calculate the
mean again.
K1: 3,6,8,9 mean=6.5
K2: 2 mean=2
K3: 12,15,18,22 mean=16.75
4. Reassign the values to clusters as per the mean calculated and calculate the
mean again.
K1: 6,8,9 mean=7.6
K2: 2,3 mean=2.5
K3: 12,15,18,22 mean=16.75
5. Reassign the values to clusters as per the mean calculated and calculate the
mean again.
K1: 6,8,9 mean=7.6
K2: 2,3 mean=2.5
K3: 12,15,18,22 mean=16.75
Applications of Clustering:
1. Marketing
2. Biology
3. Libraries
4. Insurance
5. City-planning
6. WWW