0% found this document useful (0 votes)

2 views

DM Endsem 2023-1

The document outlines the examination paper for the B.Tech (Computer Engineering) 7th Semester in Data Mining, detailing the structure and instructions for candidates. It includes various questions covering topics such as central tendency, data reduction strategies, model evaluation, Bayesian classification, support vector machines, and clustering algorithms. The paper emphasizes the need for students to demonstrate their understanding of data mining concepts and techniques through practical examples and theoretical explanations.

Uploaded by

Sufiyan Beg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

DM Endsem 2023-1

Uploaded by

Sufiyan Beg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

B.Tech (Computer Engineering).

7th Semester, Examination 2023

Data Mining 20BCSO
Max. Marks: 60 Paper Code: CEN - 701
Time: 3 Hours

Instruction to the candidates:

Attempt Any Two parts from each
Each part of the
question carries 6 question.
marks.
Q.i a) i) What do
you understand by central tendency of data? Howitis measured?
) Give a brief CO-1

In a
explanation
for data reduction
strategies.
= 300,certain
distribution
of 1000 number of data
Q3=400 and Maximum points, it is found that Q1=20, Q2 CO-1

distribution? Comment on the value

How do you find
is 450. What can you say about this
skewness of the dataset.
c)
Given the
dissimilarity between nominal attributes and binary attributes? CO-1
most similar? following data show the
dissimilarity matrix. VWhich pair of objects is

Object ld Test 1
(nominal)
A
Test 2(ordinal) Test 3 (numeric)
2 Excellent 45
Fair 22
A
Good 54
Excellent 28
Q.2.a)/ Given the foliowing
database, show all rules that one can
ABE. Also give the support and confidence of all the generate from the set CO-2
generated rules.
Tid Itemset
T1 ACD
T2 BCE
T3
ABCE
T4 BDE
T5 ABDE
T6 ABCD

b) i) State TRUE of FALSE with proper

reason /example no marks willbereasons example wherever required (without
awarded). CO-2
a) Maximal frequent itemsets are sufficient to
itemsets with their supports. determine all frequent
b) The set of all
maximal frequent sets is the set of longest
itemsets. possible frequent
ii) Given the following lattice of frequent itemset along with
their support.
List all the closed itemsets and max
itemsets.

(D(6)

(A(6)) B(5)
C(4) D(3))
AB(5) (ACC4)) AD)S (BC(3) BD(2) CD(2)

ABC(3)) ABD(2) ACD2) BCD(1)

ABCD(1)

c) CO-4
Given the following DNAsequence, answer the following questions using
minsup=3.

i) Find the maximal irequent sequences

i) Find the ciosedfrequent sequences
Si: ACGTCACG
S2: TCGA
S3: GACTGCA
S4: CAGTC

CO-4
Q.3.a) Discuss the different methods for model evaluation and selection. What is a .632
bootstrap method and fro where his 0.632 has come?

b) Suppose we havea data of afew individuals who have been surveyed. The response
to the promotional offer in the areas is listed below. Using Bayes Classification CO-3
Algorithm classify the sex (output attribute) of a new tuple whose data is
Investment=No, Travel =Yes, Reading =Yes and Health = No,
Investment Travel Reading Health Sex
promotion promotion promotion Promosion
Yes No Yes No Male
Yes Yes No No Male
No Yes Yes Yes Female
No Yes No Yes Male
Yes Yes Yes Yes Female
No No Yes No Femaie
Yes No No No Male
Yes Yes No No Male
No No No
Yes Yes Female
No No No Male
Theorem?2 CO-3
Explain the concept of Bayesian Classification with emphasis on Bayes

If theentropy function has a value is 0, what does this mean? Why do decislone
2
learning algorithms prefer choosing tests which lead to a low entropy?
does not
Assume you apply the decision tree learning algorithm to a data set which
decision tree
contain any inconsistent examples, and you continue growing the
tree which
until you have leafs which are pure. What can be said about the decision
youobtain following this procedure?
CO-4
when the
Q.4 a) Explain the working of Support vector machine with emphasis on case
data is linearly separable.
CO-4
b) Howcan we effectively construct an Ensemble classifier?
whether a
Suppose we have a dataset of individuals, and the task is to predict Age and
person will buy a product (class 1) or not (class 0) based ontwo features:
Income. Show howdoes the Adaboost algorithm tries to solve this probiem.
c) Consider a dataset of students with to fe2tures: hours of study per day (Stuay
Hours) and the number of hours of sleen per day (Sleep Hours). The dataset Co
or failed
Contains binary class labels indicating whether a student passed (Class 1)
(Class 0) an exam.
apply the
i) Using the given dataset and the Euclidean distance metric,
features:
KNN algorithm to classify a new student with the following
Study Hcurs = 4and Sleep Hours =6. Assume K=3.
i) What will happen if you change K=4?

Sleep Hours Class

Student Study Hours
7 1
S1 5

S2 3
6 1
s3
2 9
S4
5 1
S5 7
8 0
S6 5

5.a) Usesingle-link and complete-link agglomerative clustering to Cluster the following co-s
8 examples:
A1=(2,10), A2=(2,5), A3-(8,4), A4=(5,8), A5=(7,5),A6=(6,4), A7=(1,2), A8-(4.9).
both the above methods.
Also show the dendogram obtained in
Use Euclidean distance.
Both k-means and k-medoids
algorithms can perform effective
clustering. Illustrate
of k-means in comparison with the CO-5
the strength and weakness k-medoids
algorithm. Also, illustrate the strength and weakness of these
comparison with a hierarchical clustering. schemes in
)) Prove that density connected and density reachable are reflexive and symmetric inCO-5
DBSCAN algorithm
1i) Explain how DBSCAN and OPTICS finds clusters of arbitraryshape whereas partition cO-5
and hierarchical algorithms fails to find such clusters.

DataMining - Workbook MCQ
No ratings yet
DataMining - Workbook MCQ
16 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
Data Mining Practice Final Exam Solutions: True/False Questions
100% (1)
Data Mining Practice Final Exam Solutions: True/False Questions
5 pages
Gisp Syllabus 5
No ratings yet
Gisp Syllabus 5
24 pages
Q1S-1(2)
No ratings yet
Q1S-1(2)
2 pages
DM 2022
No ratings yet
DM 2022
4 pages
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
No ratings yet
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
3 pages
Assignment-2 3
No ratings yet
Assignment-2 3
4 pages
Exam DM 071214 Ans
No ratings yet
Exam DM 071214 Ans
7 pages
AIML Mod 4&5
No ratings yet
AIML Mod 4&5
7 pages
COMP1942 Question Paper
No ratings yet
COMP1942 Question Paper
5 pages
10 EST Solution
No ratings yet
10 EST Solution
16 pages
DM 2023
No ratings yet
DM 2023
8 pages
Important Questions Related To Module-1 & Module-2
No ratings yet
Important Questions Related To Module-1 & Module-2
2 pages
Data Mining and Warehousing22
No ratings yet
Data Mining and Warehousing22
3 pages
Noida Institute of Engineering and Technology, Greater Noida
No ratings yet
Noida Institute of Engineering and Technology, Greater Noida
3 pages
IS328 Final Exam
No ratings yet
IS328 Final Exam
12 pages
Exam DUT 070816 Ans
No ratings yet
Exam DUT 070816 Ans
5 pages
Adobe Scan Mar 15, 2025 (1)
No ratings yet
Adobe Scan Mar 15, 2025 (1)
1 page
Machine Learning 20CSE09
No ratings yet
Machine Learning 20CSE09
3 pages
B._Sc._H_Computer_S_3OWYH6v
No ratings yet
B._Sc._H_Computer_S_3OWYH6v
6 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
DM 2019
No ratings yet
DM 2019
7 pages
Data Analytics Model Question Paper P21CS601 New
No ratings yet
Data Analytics Model Question Paper P21CS601 New
3 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
2CSOE03-O_IR_December_2023 (2)
No ratings yet
2CSOE03-O_IR_December_2023 (2)
4 pages
CS-3035 (ML) - CS Mid March 2023
No ratings yet
CS-3035 (ML) - CS Mid March 2023
3 pages
COMP 1003&1433 Midterm (Tuesday)
No ratings yet
COMP 1003&1433 Midterm (Tuesday)
8 pages
Major 2020
No ratings yet
Major 2020
2 pages
data_mining_end_23_24
No ratings yet
data_mining_end_23_24
2 pages
DM_Practice_Problem_Set-2
No ratings yet
DM_Practice_Problem_Set-2
7 pages
C-3 Pap365er
No ratings yet
C-3 Pap365er
4 pages
Key3 DM
No ratings yet
Key3 DM
4 pages
Exam-dm1-121017-ans
No ratings yet
Exam-dm1-121017-ans
8 pages
Mid-Semester Regular Data Mining QP v1 PDF
No ratings yet
Mid-Semester Regular Data Mining QP v1 PDF
2 pages
HW 2
No ratings yet
HW 2
7 pages
dm23
No ratings yet
dm23
8 pages
Previous Year Paper - Sem 7
No ratings yet
Previous Year Paper - Sem 7
12 pages
MT2023-Sol
No ratings yet
MT2023-Sol
8 pages
QB - Data Science
No ratings yet
QB - Data Science
7 pages
Data Mining f20 Practice Final Solutions
No ratings yet
Data Mining f20 Practice Final Solutions
8 pages
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
Final Assessment Data Mining
No ratings yet
Final Assessment Data Mining
2 pages
ML FA24 Final Term Exam (Solution)
No ratings yet
ML FA24 Final Term Exam (Solution)
19 pages
Statistical learning
No ratings yet
Statistical learning
4 pages
Script of E__Previous Question Papers_URR18 03.08.2023_VI Semester_U18CS605.pdf
No ratings yet
Script of E__Previous Question Papers_URR18 03.08.2023_VI Semester_U18CS605.pdf
10 pages
Q1R_ext(2)
No ratings yet
Q1R_ext(2)
4 pages
HW_02
No ratings yet
HW_02
3 pages
Data Mining BITS-PILANI Mid Semester Sample
No ratings yet
Data Mining BITS-PILANI Mid Semester Sample
10 pages
Mid Semester Regular-DM
No ratings yet
Mid Semester Regular-DM
3 pages
ml-20230316-1
No ratings yet
ml-20230316-1
9 pages
ML CAT QNS
No ratings yet
ML CAT QNS
4 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
ML End Sem Nov2022 Paper
No ratings yet
ML End Sem Nov2022 Paper
4 pages
UCT633_EST_23
No ratings yet
UCT633_EST_23
3 pages
cs484 hw2
No ratings yet
cs484 hw2
2 pages
ML Assignment 3 Nptel 2019
No ratings yet
ML Assignment 3 Nptel 2019
26 pages
Isp565 - Its665 Feb 22
No ratings yet
Isp565 - Its665 Feb 22
17 pages
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Normal Distribution: Example: John Michael Obtained A Score of 82 in
No ratings yet
Normal Distribution: Example: John Michael Obtained A Score of 82 in
3 pages
Modelling Material Plasticity in Ansys
No ratings yet
Modelling Material Plasticity in Ansys
4 pages
Weber Patrick Rep T
No ratings yet
Weber Patrick Rep T
14 pages
Control of VSC-HVDC For Wind Power
100% (1)
Control of VSC-HVDC For Wind Power
75 pages
Transformations
No ratings yet
Transformations
19 pages
Wipro Practice Qs
100% (2)
Wipro Practice Qs
65 pages
Critical Indicators Mixed Use Buildings Nigeria (2021) 10p (Salami Et Al.)
No ratings yet
Critical Indicators Mixed Use Buildings Nigeria (2021) 10p (Salami Et Al.)
10 pages
Advanced PDE HW1
No ratings yet
Advanced PDE HW1
3 pages
Mastery Assessment Y3
No ratings yet
Mastery Assessment Y3
29 pages
Design of Column Base Plate
No ratings yet
Design of Column Base Plate
6 pages
Formulario de Calculo Diferencial e Integral (JRM) - Jesus Rubi Miranda
No ratings yet
Formulario de Calculo Diferencial e Integral (JRM) - Jesus Rubi Miranda
3 pages
MATH L4 M-A 2025- 30 COPIES
No ratings yet
MATH L4 M-A 2025- 30 COPIES
4 pages
RMQ
No ratings yet
RMQ
8 pages
Rohde and Schwarz Assessing A MIMO Channel White Paper
No ratings yet
Rohde and Schwarz Assessing A MIMO Channel White Paper
18 pages
Basic Mathematics (MT2311D1) : Problem Solving BY: Siti Sarah Binti Sekeri Aimi Najwa Binti Ghazali
No ratings yet
Basic Mathematics (MT2311D1) : Problem Solving BY: Siti Sarah Binti Sekeri Aimi Najwa Binti Ghazali
32 pages
Sonal Trivedi PDF
No ratings yet
Sonal Trivedi PDF
201 pages
Otc 24837 MS
No ratings yet
Otc 24837 MS
8 pages
Resistance and Resistivity
No ratings yet
Resistance and Resistivity
32 pages
Research Article: Indian Classical Dance Action Identification and Classification With Convolutional Neural Networks
No ratings yet
Research Article: Indian Classical Dance Action Identification and Classification With Convolutional Neural Networks
11 pages
EJERCICIO
No ratings yet
EJERCICIO
3 pages
Mathematics: ACT College & Career Readiness Standards
No ratings yet
Mathematics: ACT College & Career Readiness Standards
12 pages
Feep 210
100% (1)
Feep 210
8 pages
Mood LogisticRegressionCannot 2010
No ratings yet
Mood LogisticRegressionCannot 2010
17 pages
MAT133 Solving Inequalities
100% (1)
MAT133 Solving Inequalities
9 pages
Engineering Physics r19 July 2021
No ratings yet
Engineering Physics r19 July 2021
2 pages
Cosi Ujm Advanced Image Processing
No ratings yet
Cosi Ujm Advanced Image Processing
2 pages
BTP Presentation
No ratings yet
BTP Presentation
29 pages
Review On Piecewise Functions
100% (1)
Review On Piecewise Functions
46 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DM Endsem 2023-1

Uploaded by

DM Endsem 2023-1

Uploaded by

B.Tech (Computer Engineering).

7th Semester, Examination 2023

Instruction to the candidates:

distribution? Comment on the value

b) i) State TRUE of FALSE with proper

ABC(3)) ABD(2) ACD2) BCD(1)

i) Find the maximal irequent sequences

Sleep Hours Class

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.