0% found this document useful (0 votes)
10 views7 pages

bda quiz QA

Download as pdf or txt
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 7

BIGDATA ANALYTICS [21CS71]

QUIZ BASED ON MODULE 1 & 3

1. What is a defining characteristic of Big Data?

A. Fixed schema

B. Small data sets

C. High volume, velocity, and variety

D. Centralized storage

Answer: C. High volume, velocity, and variety

2. Which of the following best describes scalability in Big Data systems?

A. The ability to process data using a single machine

B. The capacity to handle increasing amounts of work efficiently

C. Reduction in hardware requirements

D. Limiting system performance to a specific level

Answer: B. The capacity to handle increasing amounts of work efficiently

3. What is parallel processing in Big Data analytics?

A. Executing tasks sequentially on a single processor

B. Using multiple processors to perform tasks simultaneously

C. Storing data in parallel databases

D. Running one task after another in batches

Answer: B. Using multiple processors to perform tasks simultaneously

4. In Big Data architecture design, what does the term "data lake" refer to?

A. A highly structured database

B. A central repository for raw, unstructured, and semi-structured data

C. A real-time analytics platform

D. A system for archiving historical data

Answer: B. A central repository for raw, unstructured, and semi-structured data


5. Why is data preprocessing important in Big Data analytics?

A. To directly apply machine learning algorithms

B. To improve data quality and prepare it for analysis

C. To eliminate all data redundancies

D. To generate insights without any human intervention

Answer: B. To improve data quality and prepare it for analysis

6. Which of the following is an example of structured data?

A. Sensor readings stored in a relational database

B. Customer reviews on an e-commerce platform

C. Images stored in a cloud system

D. Tweets and social media posts

Answer: A. Sensor readings stored in a relational database

7. What is a common method of storing Big Data in distributed systems?

A. Relational databases

B. Spreadsheets

C. Hadoop Distributed File System (HDFS)

D. Local drives

Answer: C. Hadoop Distributed File System (HDFS)

8. Which of the following technologies is commonly used for Big Data analysis?

A. Microsoft Word

B. Apache Spark

C. JavaScript

D. Adobe Photoshop

Answer: B. Apache Spark

9. What is the main advantage of using case studies in Big Data Analytics?

A. Eliminating data redundancy

B. Understanding real-world applications and practical challenges

C. Reducing data storage requirements

D. Limiting the scope of analysis


Answer: B. Understanding real-world applications and practical challenges

10. Which of the following is NOT an example of a Big Data application?

A. Real-time fraud detection in banking

B. Personalized recommendations in e-commerce

C. Image editing in design software

D. Predictive maintenance in manufacturing

Answer: C. Image editing in design software

11. What is the primary goal of Big Data analytics?

A. To store all data indefinitely

B. To derive meaningful insights from large and complex datasets

C. To reduce storage requirements

D. To ensure data is structured

Answer: B. To derive meaningful insights from large and complex datasets

12. Which of the following is NOT a common challenge in Big Data analytics?

A. Handling data velocity

B. Ensuring data quality

C. Performing data analysis on small datasets

D. Managing data variety

Answer: C. Performing data analysis on small datasets

13. What is the role of data preprocessing in Big Data analytics?

A. To discard irrelevant data

B. To ensure data is in a usable format for analysis

C. To create visualizations directly

D. To archive data for long-term storage

Answer: B. To ensure data is in a usable format for analysis

14. Which of the following describes a distributed data storage system?

A. All data is stored on a single server

B. Data is spread across multiple servers for scalability and redundancy

C. Data is only accessible to a single application

D. Data is stored in physical spreadsheets

Answer: B. Data is spread across multiple servers for scalability and redundancy
15. What does "data velocity" refer to in Big Data?

A. The variety of data types in a dataset


B. The speed at which data is generated and processed
C. The accuracy of the data collected
D. The method used to store data

Answer: B. The speed at which data is generated and processed

16. What is a key characteristic of NoSQL databases?

A. Fixed schema

B. Tabular structure

C. Scalability and flexibility for unstructured data

D. Dependency on a single server

Answer: C. Scalability and flexibility for unstructured data

17. Which of the following is NOT a type of NoSQL database?

A. Key-Value Store

B. Column-Family Store

C. Relational Database

D. Document Store

Answer: C. Relational Database

18. What is the purpose of a "shared-nothing" architecture in Big Data management?

A. To centralize data processing

B. To distribute tasks across nodes without shared memory or storage

C. To reduce system redundancy

D. To use a single machine for all tasks

Answer: B. To distribute tasks across nodes without shared memory or storage

19. Which data model does MongoDB primarily use?

A. Tabular

B. Key-value pairs

C. Document-oriented

D. Graph-based

Answer: C. Document-oriented

20. In Cassandra, what is a "column family"?


A. A collection of similar rows stored together

B. A single field in a table

C. A type of relational database table

D. A file format for NoSQL databases

Answer: A. A collection of similar rows stored together

21. Which of the following is true about MongoDB?

A. It requires a predefined schema for all collections.

B. It stores data in tables with rows and columns.

C. It stores data as JSON-like documents.

D. It uses SQL for querying data.

Answer: C. It stores data as JSON-like documents.

22. What is a key advantage of Cassandra over traditional relational databases?

A. Schema enforcement

B. Support for complex joins

C. High availability and fault tolerance

D. Single-node operation

Answer: C. High availability and fault tolerance

23. Which query language is used to interact with Cassandra?

A. SQL

B. CQL (Cassandra Query Language)

C. JSON

D. XML

Answer: B. CQL (Cassandra Query Language)

24. What is a primary use case for MongoDB?

A. Complex transactional systems

B. Managing highly relational data

C. Real-time analytics for unstructured data

D. Storing images and videos directly

Answer: C. Real-time analytics for unstructured data

25. What is the CAP theorem in the context of NoSQL databases?

A. A theory about consistency, accuracy, and persistence


B. A trade-off between consistency, availability, and partition tolerance

C. A principle for designing relational databases

D. A formula for calculating data throughput

Answer: B. A trade-off between consistency, availability, and partition tolerance

26.What is a primary advantage of NoSQL databases over traditional relational databases?

A. Enforcing strict schema rules

B. Handling structured data only

C. Scalability and ability to handle unstructured or semi-structured data

D. Support for complex SQL joins

Answer: C. Scalability and ability to handle unstructured or semi-structured data

27. Which of the following is a NoSQL data architecture pattern?

A. Master-Slave replication

B. Normalized relational model

C. Distributed key-value store

D. Entity-Relationship modelling

Answer: C. Distributed key-value store

28. How does a "shared-nothing" architecture benefit Big Data systems?

A. It minimizes communication overhead by centralizing processing

B. It ensures that each node operates independently without shared memory or storage

C. It eliminates the need for replication and backups

D. It allows only single-threaded processing

Answer: B. It ensures that each node operates independently without shared memory or storage

29. In MongoDB, what is a "collection"?

A. A single row in a table

B. A grouping of documents in a database, similar to a table

C. A method to store binary files

D. A query used to retrieve data

Answer: B. A grouping of documents in a database, similar to a table

30. What feature of Cassandra makes it particularly suitable for Big Data applications?

A. Strong consistency guarantees

B. High availability through decentralized architecture


C. Fixed schema for all data

D. Built-in support for SQL

Answer: B. High availability through decentralized architecture

31. What is the primary difference between MongoDB and Cassandra?

A. MongoDB uses column-family storage; Cassandra uses document storage

B. Cassandra is schema-less, while MongoDB enforces strict schemas

C. MongoDB is document-oriented; Cassandra uses a column-family structure

D. MongoDB supports SQL queries, while Cassandra does not

Answer: C. MongoDB is document-oriented; Cassandra uses a column-family structure

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy