DM&DW MCQs

Download as pdf or txt
Download as pdf or txt
You are on page 1of 16

1.

Which OLAP operation involves moving down in the concept hierarchy


to get more detailed data?
○ A. Roll-up
○ B. Drill-down
○ C. Slice
○ D. Dice
○ Correct Answer: B. Drill-down
2. Which operation in OLAP is the reverse of the drill-down operation?
○ A. Slice
○ B. Pivot
○ C. Roll-up
○ D. Dice
○ Correct Answer: C. Roll-up
3. In OLAP, what is the operation called that involves selecting a single
dimension to create a new sub-cube?
○ A. Slice
○ B. Dice
○ C. Roll-up
○ D. Pivot
○ Correct Answer: A. Slice
4. Which OLAP operation rotates the current view to provide a new
perspective?
○ A. Roll-up
○ B. Drill-down
○ C. Dice
○ D. Pivot
○ Correct Answer: D. Pivot
5. Which OLAP operation selects two or more dimensions to create a
sub-cube?
○ A. Slice
○ B. Dice
○ C. Drill-down
○ D. Pivot
○ Correct Answer: B. Dice

6. Which of the following is NOT a characteristic of a data warehouse?


○ A. Subject-oriented
○ B. Integrated
○ C. Volatile
○ D. Time-variant
○ Correct Answer: C. Volatile
7. Data warehouses typically store data for what duration?
○ A. 1-2 years
○ B. 5-10 years
○ C. Only current data
○ D. Less than 1 year
○ Correct Answer: B. 5-10 years
8. What is the main purpose of a data warehouse?
○ A. Transaction processing
○ B. Real-time analytics
○ C. Data consolidation and historical analysis
○ D. Data deletion
○ Correct Answer: C. Data consolidation and historical analysis
9. Which schema in a data warehouse is arranged with a central fact table
surrounded by dimension tables?
○ A. Star schema
○ B. Snowflake schema
○ C. Fact constellation schema
○ D. Hybrid schema
○ Correct Answer: A. Star schema
10. In a data warehouse, what term is used to describe the central table that
contains quantitative data for analysis?
○ A. Dimension table
○ B. Fact table
○ C. Lookup table
○ D. Reference table
○ Correct Answer: B. Fact table

MCQs on Data Mining

11. What is the main goal of data mining?


○ A. Storing large volumes of data
○ B. Extracting hidden patterns from data
○ C. Data entry and retrieval
○ D. Database management
○ Correct Answer: B. Extracting hidden patterns from data
12. Which of the following is a data mining technique used for predictive
modeling?
○ A. Clustering
○ B. Classification
○ C. Association rule mining
○ D. Data cleaning
○ Correct Answer: B. Classification
13. In data mining, what technique is used to find relationships between
variables in large datasets?
○ A. Clustering
○ B. Regression analysis
○ C. Association rule mining
○ D. Anomaly detection
○ Correct Answer: C. Association rule mining
14. Which data mining task is focused on grouping similar records
together?
○ A. Classification
○ B. Clustering
○ C. Regression
○ D. Association
○ Correct Answer: B. Clustering
15. What is the process of cleaning and preparing data for analysis called in
data mining?
○ A. Data warehousing
○ B. Data preprocessing
○ C. Data mining
○ D. Data visualization
○ Correct Answer: B. Data preprocessing

16. Which system is designed for transaction processing?


○ A. OLAP
○ B. OLTP
○ C. Data warehouse
○ D. Data mart
○ Correct Answer: B. OLTP
17. Which of the following is a characteristic of OLAP systems?
○ A. High number of users
○ B. Low query complexity
○ C. Real-time data updates
○ D. Multidimensional views
○ Correct Answer: D. Multidimensional views
18. In OLTP systems, the typical database size is:
○ A. Small to medium
○ B. Very large
○ C. Unlimited
○ D. Non-existent
○ Correct Answer: A. Small to medium
19. Which type of system typically has user-defined views for data
analysis?
○ A. OLAP
○ B. OLTP
○ C. Data warehouse
○ D. ETL
○ Correct Answer: A. OLAP
20. Data in OLTP systems is generally:
○ A. Aggregated
○ B. Historical
○ C. Detailed and current
○ D. Optimized for queries
○ Correct Answer: C. Detailed and current

21. MOLAP stands for:


○ A. Multidimensional OLAP
○ B. Managed OLAP
○ C. Modular OLAP
○ D. Mobile OLAP
○ Correct Answer: A. Multidimensional OLAP
22. ROLAP stands for:
○ A. Relational OLAP
○ B. Resourceful OLAP
○ C. Real-time OLAP
○ D. Regional OLAP
○ Correct Answer: A. Relational OLAP
23. Which OLAP implementation uses a relational database?
○ A. MOLAP
○ B. ROLAP
○ C. HOLAP
○ D. DOLAP
○ Correct Answer: B. ROLAP
24. Which OLAP implementation is known for high query performance using
pre-aggregated data?
○ A. MOLAP
○ B. ROLAP
○ C. HOLAP
○ D. DOLAP
○ Correct Answer: A. MOLAP
25. MOLAP systems store data in:
○ A. Flat files
○ B. Multidimensional cubes
○ C. Relational tables
○ D. XML format
○ Correct Answer: B. Multidimensional cubes

MCQs on Dimensional Modeling and Schema

26. Dimensional modeling is used primarily in:


○ A. OLTP systems
○ B. Data warehousing
○ C. File processing systems
○ D. Network databases
○ Correct Answer: B. Data warehousing
27. Which schema has a central fact table connected to multiple dimension
tables, but dimension tables are normalized?
○ A. Star schema
○ B. Snowflake schema
○ C. Fact constellation schema
○ D. Galaxy schema
○ Correct Answer: B. Snowflake schema
28. In a star schema, the fact table is:
○ A. Central and connected to dimension tables
○ B. A child of dimension tables
○ C. The same as dimension tables
○ D. Not used
○ Correct Answer: A. Central and connected to dimension tables
29. Which schema is a complex version of the star schema with multiple
fact tables?
○ A. Star schema
○ B. Snowflake schema
○ C. Fact constellation schema
○ D. Hybrid schema
○ Correct Answer: C. Fact constellation schema
30. Dimensional modeling involves:
○ A. Normalization
○ B. Denormalization
○ C. Both normalization and denormalization
○ D. Neither normalization nor denormalization
○ Correct Answer: B. Denormalization

MCQs on Data Mining Techniques

31. Which of the following is used for market basket analysis?


○ A. Classification
○ B. Clustering
○ C. Association rule mining
○ D. Regression
○ Correct Answer: C. Association rule mining
32. What is the process of dividing data into subsets that have similar
characteristics?
○ A. Classification
○ B. Clustering
○ C. Regression
○ D. Association
○ Correct Answer: B. Clustering
33. Which data mining technique predicts a categorical outcome?
○ A. Regression
○ B. Clustering
○ C. Classification
○ D. Association
○ Correct Answer: C. Classification
34. Which method is used to identify unusual patterns that do not conform
to expected behavior?
○ A. Clustering
○ B. Anomaly detection
○ C. Classification
○ D. Regression
○ Correct Answer: B. Anomaly detection
35. Which technique involves estimating the relationships among
variables?
○ A. Clustering
○ B. Classification
○ C. Association rule mining
○ D. Regression
○ Correct Answer: D. Regression

36. The process of transforming data into information for business


decision-making is called:
○ A. Data cleaning
○ B. Data transformation
○ C. Data warehousing
○ D. Data mining
○ Correct Answer: C. Data warehousing
37. Which component of data warehousing handles data extraction,
transformation, and loading?
○ A. OLAP
○ B. ETL
○ C. OLTP
○ D. Data marts
○ Correct Answer: B. ETL
38. What is a subset of a data warehouse that is focused on a particular
area or department?
○ A. Data mart
○ B. Data lake
○ C. Data cube
○ D. Data pipeline
○ Correct Answer: A. Data mart
39. Which process in ETL involves correcting data inconsistencies?
○ A. Extraction
○ B. Transformation
○ C. Loading
○ D. Indexing
○ Correct Answer: B. Transformation
40. In data warehousing, what is the purpose of a data cube?
○ A. Storing unstructured data
○ B. Visualizing multidimensional data
○ C. Real-time data processing
○ D. Transaction management
○ Correct Answer: B. Visualizing multidimensional data

41. Which step in the data mining process involves selecting the relevant
data to be analyzed?
○ A. Data cleaning
○ B. Data selection
○ C. Data transformation
○ D. Data mining
○ Correct Answer: B. Data selection
42. During which step are algorithms applied to extract patterns from data?
○ A. Data preparation
○ B. Data integration
○ C. Data mining
○ D. Data interpretation
○ Correct Answer: C. Data mining
43. Which phase involves understanding the business objectives and
requirements?
○ A. Data preparation
○ B. Business understanding
○ C. Data mining
○ D. Model evaluation
○ Correct Answer: B. Business understanding
44. What is the final step in the data mining process where results are
interpreted and used?
○ A. Data preparation
○ B. Model evaluation
○ C. Deployment
○ D. Data cleaning
○ Correct Answer: C. Deployment
45. Which step involves merging data from different sources into a coherent
data store?
○ A. Data cleaning
○ B. Data selection
○ C. Data integration
○ D. Data mining
○ Correct Answer: C. Data integration

MCQs on Data Warehousing and Business Intelligence

46. Business intelligence systems primarily aim to:


○ A. Handle transactions
○ B. Support business decision-making
○ C. Perform batch processing
○ D. Store operational data
○ Correct Answer: B. Support business decision-making
47. In a data warehouse, what term describes data that can be traced over
time to understand changes?
○ A. Volatile data
○ B. Time-variant data
○ C. Static data
○ D. Current data
○ Correct Answer: B. Time-variant data
48. What type of analysis is used to understand historical trends and
patterns?
○ A. Predictive analysis
○ B. Descriptive analysis
○ C. Prescriptive analysis
○ D. Real-time analysis
○ Correct Answer: B. Descriptive analysis
49. Which business intelligence tool provides a visual representation of
data trends and patterns?
○ A. Data cube
○ B. OLAP cube
○ C. Dashboard
○ D. ETL tool
○ Correct Answer: C. Dashboard
50. The term 'drill-through' in business intelligence refers to:
○ A. Aggregating data
○ B. Viewing detailed data
○ C. Rolling up data
○ D. Slicing data
○ Correct Answer: B. Viewing detailed data

1. Which step involves selecting the granularity of data in the dimensional


modeling process?
○ a) Choose the Business Process
○ b) Choose the Grain
○ c) Choose the Facts
○ d) Choose the Dimensions
○ Answer: b) Choose the Grain
2. What is the first step in the Four-Step Method from ER to Dimensional
Modeling?
○ a) Choose the Facts
○ b) Choose the Grain
○ c) Choose the Business Process
○ d) Choose the Dimensions
○ Answer: c) Choose the Business Process
3. Which of the following is NOT typically considered a dimension in
dimensional modeling?
○ a) Time
○ b) Product
○ c) Geography
○ d) Sales
○ Answer: d) Sales
4. In dimensional modeling, what is a common example of a fact?
○ a) Time
○ b) Quantity Sold
○ c) Product Name
○ d) Region
○ Answer: b) Quantity Sold
5. What does the grain define in a dimensional model?
○ a) The dimensions to be used
○ b) The level of detail for each fact table record
○ c) The facts to be used
○ d) The business process to be analyzed
○ Answer: b) The level of detail for each fact table record
6. ETL stands for:
○ a) Extract, Transform, Load
○ b) Extract, Transfer, Load
○ c) Evaluate, Transform, Load
○ d) Extract, Transform, Link
○ Answer: a) Extract, Transform, Load
7. Which phase of the ETL process involves cleaning and preparing data?
○ a) Extract
○ b) Transform
○ c) Load
○ d) Transfer
○ Answer: b) Transform
8. In the ETL process, data loading refers to:
○ a) Extracting data from source systems
○ b) Transforming data into a suitable format
○ c) Loading data into the target database
○ d) Cleaning and removing duplicates
○ Answer: c) Loading data into the target database
9. Which of the following is NOT typically a step in the ETL process?
○ a) Data Extraction
○ b) Data Cleansing
○ c) Data Analysis
○ d) Data Loading
○ Answer: c) Data Analysis
10. What is a common tool used in the ETL process?
○ a) Microsoft Excel
○ b) Apache Hadoop
○ c) Apache Spark
○ d) Talend
○ Answer: d) Talend

11. What is the main goal of Business Intelligence (BI)?


○ a) To increase data storage capacity
○ b) To analyze and present business data
○ c) To create marketing strategies
○ d) To manage customer relationships
○ Answer: b) To analyze and present business data
12. Which of the following is NOT a typical component of a BI system?
○ a) Data Warehouse
○ b) ETL Tools
○ c) Data Mining
○ d) Social Media Marketing
○ Answer: d) Social Media Marketing
13. Which BI tool is known for its data visualization capabilities?
○ a) Microsoft SQL Server
○ b) Apache Hive
○ c) Tableau
○ d) Talend
○ Answer: c) Tableau
14. What does OLAP stand for?
○ a) Online Analytical Processing
○ b) Online Linear Analytical Processing
○ c) Offline Analytical Processing
○ d) Offline Linear Analytical Processing
○ Answer: a) Online Analytical Processing
15. Which OLAP operation involves summarizing data?
○ a) Drill Down
○ b) Roll Up
○ c) Slice
○ d) Dice
○ Answer: b) Roll Up

16. What is the main purpose of data mining?


○ a) To store large amounts of data
○ b) To extract patterns from large data sets
○ c) To design data warehouses
○ d) To clean and prepare data for analysis
○ Answer: b) To extract patterns from large data sets
17. Which data mining technique is used to predict a categorical label?
○ a) Classification
○ b) Clustering
○ c) Regression
○ d) Association
○ Answer: a) Classification
18. Which data mining technique groups similar records into clusters?
○ a) Classification
○ b) Clustering
○ c) Regression
○ d) Association
○ Answer: b) Clustering
19. What does the Apriori algorithm help identify?
○ a) Predictive models
○ b) Association rules
○ c) Cluster centers
○ d) Decision trees
○ Answer: b) Association rules
20. Which of the following is NOT a common data mining algorithm?
○ a) K-Means
○ b) Decision Trees
○ c) Linear Regression
○ d) TCP/IP
○ Answer: d) TCP/IP

21. A data cube in OLAP is used for:


○ a) Storing transactional data
○ b) Performing multidimensional analysis
○ c) Normalizing databases
○ d) Designing relational schemas
○ Answer: b) Performing multidimensional analysis
22. Which OLAP operation involves breaking down data into finer levels of
detail?
○ a) Roll Up
○ b) Drill Down
○ c) Slice
○ d) Dice
○ Answer: b) Drill Down
23. What is the primary advantage of a data cube?
○ a) Easy data entry
○ b) Efficient data retrieval
○ c) Simplified data storage
○ d) Real-time data processing
○ Answer: b) Efficient data retrieval
24. In a data cube, what is a "measure"?
○ a) A dimension
○ b) An attribute
○ c) A fact
○ d) A hierarchy
○ Answer: c) A fact
25. Which OLAP operation selects a subset of the data cube by specific
criteria?
○ a) Drill Down
○ b) Roll Up
○ c) Slice
○ d) Dice
○ Answer: d) Dice
26. A data lake is designed to store:
○ a) Structured data only
○ b) Unstructured data only
○ c) Both structured and unstructured data
○ d) Metadata only
○ Answer: c) Both structured and unstructured data
27. Which of the following is a key characteristic of a data lake?
○ a) Pre-defined schema
○ b) Schema-on-read
○ c) Limited data storage
○ d) High data redundancy
○ Answer: b) Schema-on-read
28. Data lakes are typically used for:
○ a) Transaction processing
○ b) Archival storage
○ c) Big data analytics
○ d) Data normalization
○ Answer: c) Big data analytics
29. Which of the following is NOT a common challenge with data lakes?
○ a) Data governance
○ b) Data security
○ c) Data duplication
○ d) Data warehousing
○ Answer: d) Data warehousing
30. A data lake is best suited for storing:
○ a) Small transactional datasets
○ b) Large volumes of raw data
○ c) Highly processed data
○ d) Aggregated data only
○ Answer: b) Large volumes of raw data
31. A data mart is typically:
○ a) A large, centralized data repository
○ b) A subset of a data warehouse
○ c) A type of OLAP cube
○ d) Used for transaction processing
○ Answer: b) A subset of a data warehouse
32. Which type of data mart is created for a specific department or business
unit?
○ a) Centralized
○ b) Dependent
○ c) Independent
○ d) Hybrid
○ Answer: c) Independent
33. The primary advantage of a data mart is:
○ a) Reduced data storage costs
○ b) Faster query performance
○ c) Simplified data normalization
○ d) Increased data redundancy
○ Answer: b) Faster query performance
34. Which of the following best describes a dependent data mart?
○ a) Operates independently from the data warehouse
○ b) Sourced directly from the data warehouse
○ c) Stores real-time transactional data
○ d) Used for operational processing
○ Answer: b) Sourced directly from the data warehouse
35. Data marts are often used for:
○ a) Real-time data processing
○ b) Department-specific analysis
○ c) Data warehousing
○ d) Metadata management
○ Answer: b) Department-specific analysis

36. Metadata is best described as:


○ a) Data about data
○ b) Data storage technology
○ c) Data transformation rules
○ d) Data warehouse schema
○ Answer: a) Data about data
37. Which type of metadata describes the structure of data?
○ a) Descriptive Metadata
○ b) Structural Metadata
○ c) Administrative Metadata
○ d) Process Metadata
○ Answer: b) Structural Metadata
38. In a data warehouse, metadata is used to:
○ a) Store large volumes of data
○ b) Manage data quality
○ c) Describe data content and structure
○ d) Perform data transformations
○ Answer: c) Describe data content and structure
39. Which of the following is NOT a category of metadata?
○ a) Descriptive Metadata
○ b) Analytical Metadata
○ c) Structural Metadata
○ d) Administrative Metadata
○ Answer: b) Analytical Metadata
40. Metadata management is important for:
○ a) Data redundancy
○ b) Data quality and governance
○ c) Data storage optimization
○ d) Data encryption
○ Answer: b) Data quality and governance

41. Data governance refers to:


○ a) Storing data in databases
○ b) Managing data quality, privacy, and security
○ c) Designing data warehouses
○ d) Analyzing business data
○ Answer: b) Managing data quality, privacy, and security
42. A key component of data governance is:
○ a) Data warehousing
○ b) Data visualization
○ c) Data stewardship
○ d) Data extraction
○ Answer: c) Data stewardship
43. Data governance helps ensure:
○ a) Data redundancy
○ b) Data accuracy and consistency
○ c) Data compression
○ d) Data visualization
○ Answer: b) Data accuracy and consistency
44. Which role is typically responsible for overseeing data governance
policies?
○ a) Data Analyst
○ b) Data Engineer
○ c) Chief Data Officer (CDO)
○ d) Business Analyst
○ Answer: c) Chief Data Officer (CDO)
45. Data governance frameworks often include:
○ a) Data modeling tools
○ b) Data encryption standards
○ c) Policies and procedures for data management
○ d) ETL processes
○ Answer: c) Policies and procedures for data management

Data Security and Privacy

46. Data encryption is used to:


○ a) Improve query performance
○ b) Protect data privacy and security
○ c) Normalize data
○ d) Analyze data
○ Answer: b) Protect data privacy and security
47. Which of the following is a common data security threat?
○ a) Data warehousing
○ b) Data breaches
○ c) Data normalization
○ d) Data redundancy
○ Answer: b) Data breaches
48. Data anonymization is a technique used to:
○ a) Increase data storage capacity
○ b) Protect individual privacy
○ c) Improve data quality
○ d) Enhance data visualization
○ Answer: b) Protect individual privacy
49. In the context of data privacy, what does GDPR stand for?
○ a) General Data Processing Regulation
○ b) General Data Protection Regulation
○ c) Global Data Privacy Regulation
○ d) General Digital Privacy Regulation
○ Answer: b) General Data Protection Regulation
50. Data masking is primarily used to:
○ a) Improve data analysis
○ b) Hide sensitive information
○ c) Compress data
○ d) Store data efficiently
○ Answer: b) Hide sensitive information

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy