Combined Quizes

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 8

NAMA : Adhien Kenya Anims Estetikha

NIM : 20.52.1330

INSTRUCTION: Choose best options.


1. Which of the following is an example of big data utilized in action today?
A. Individual, Unconnected Hospital Databases
B. Social Media
C. The Internet
D. Wi-Fi Networks

2. What reasoning was given for the following: why is the "data storage to price ratio"
relevant to big data?
A. Companies can't afford to own, maintain, and spend the energy to support large data
storage unless the cost is sufficiently low.
B. It isn't, it was just an arbitrary example of big data usage.
C. Lower prices mean larger storage becomes easier to access for everyone,
creating bigger amounts of data for client-facing services to work with.
D. Larger storage means easier accessibility to big data for every user because it allows
users to download in bulk.

3. What is the best description of personalized marketing enabled by big data?


A. Marketing to each customer on an individual level and suiting to their needs.
B. Being able to use personalized data from every single customer for personalized
marketing needs.
C. Being able to obtain and use customer information for groups of consumers and
utilize them for marketing needs.

4. Of the following, which is an example of personalized marketing related with big data?
A. A survey that asks your age and markets to you a specific brand
B. News outlets gathering information from the internet in order to report them to the
public.
C. Google ordering ads to show items based on recent and past search results.

7. What is the workflow for working with big data?

A. Theory -> Models -> Precise Advice


B. Extrapolation -> Understanding -> Reproducing
C. Big Data -> Better Models -> Higher Precision

8. Which is the most compelling reason why mobile advertising is related to big data?
A. Mobile advertising in and of itself is always associated with big data.
B. Mobile advertising allows massive cellular/mobile texting to a wide audience, thus
providing large amounts of data.
C. Mobile advertising benefits from data integration with location which requires big
data.
D. Since almost everyone owns a cell/mobile phone, the mobile advertising market is large
and thus requires big data to contain all the information.

9. What are the three types of diverse data sources?

A. Information Networks, Map Data, and People


B. Machine Data, Organizational Data, and People
C. Sensor Data, Organizational Data, and Social Media
D. Machine Data, Map Data, and Social Media

10. What is an example of machine data?

A. Weather station sensor output.


B. Sorted data from Amazon regarding customer info.
C. Social Media

11. What is an example of organizational data?

A. Disease data from Center for Disease Control.


B. Social Media
C. Satellite Data

12. Of the three data sources, which is the hardest to implement and streamline into a model?

A. Machine Data
B. Organizational Data
C. People

13. Which of the following summarizes the process of using data streams?

A. Integration -> Personalization -> Precision


B. Big Data -> Better Models -> Higher Precision
C. Theory -> Models -> Precise Advice
D. Extrapolation -> Understanding -> Reproducing

14. Where does the real value of big data often come from?
A. Size of the data.
B. Having data-enabled decisions and actions from the insights of new data.
C. Using the three major data sources: Machines, People, and Organizations.
D. Combining streams of data and analyzing them for new insights.

15. What does it mean for a device to be "smart"?

A. Must have a way to interact with the user.


B. Having a specific processing speed in order to keep up with the demands of data
processing.
C. Connect with other devices and have knowledge of the environment.

16. What does the term "in situ" mean in the context of big data?

A. Accelerometers.
B. In the situation
C. The sensors used in airplanes to measure altitude.
D. Bringing the computation to the location of the data.

17. Which of the following are reasons mentioned for why data generated by people are hard to
process? Choose all that apply.

A. The velocity of the data is very high.


B. They cannot be modeled and stored.
C. Very unstructured data.
D. Skilled people to analyze the data are hard to come by.

18. What is the purpose of retrieval and storage; pre-processing; and analysis in order to convert
multiple data sources into valuable data?

A. To enable ETL methods.


B. To allow scalable analytical solutions to big data.
C. Designed to work like the ETL process.
D. Since the multi-layered process is built into the Neo4j database connection.

19. Which of the following are benefits of organization-generated data? Choose all that apply.

A. Higher Sales
B. Customer Satisfaction
C. Better Profit Margins
D. Improved Safety
E. High Velocity
20. What are data silos and why are they bad?

A. A giant centralized database to house all the data production within an organization. Bad
because it hinders opportunity for data generation.
B. Highly unstructured data. Bad because it does not provide meaningful results for
organizations.
C. Data produced from an organization that is spread out. Bad because it creates
unsynchronized and invisible data.
D. A giant centralized database to house all the data produces within an organization. Bad
because it is hard to maintain as highly structured data.

21. Which of the following are benefits of data integration? Choose all that apply.

A. Adds value to big data.


B. Unify your data system.
C. Monitoring of data.
D. Reduce data complexity.
E. Increase data availability.
F. Increase data collaboration.

22. Which of the following are parts of the 5 P's of data science and what is the additional P
introduced in the slides?

A. Process
B. People
C. Purpose
D. Product
E. Perception
F. Platforms
G. Programmability

23. Which of the following are part of the four main categories to acquire, access, and retrieve
data?

A. Text Files
B. Remote Data
C. NoSQL Storage
D. Traditional Databases
E. Web Services
24. What are the steps required for data analysis?

A. Classification, Regression, Analysis


B. Regression, Evaluate, Classification
C. Investigate, Build Model, Evaluate
D. Select Technique, Build Model, Evaluate

25. Of the following, which is a technique mentioned in the videos for building a model?

A. Evaluation
B. Analysis
C. Investigation
D. Validation

26. What is the first step in finding a right problem to tackle in data science?

A. Define the Problem


B. Ask the Right Questions
C. Assess the Situation
D. Define Goals

27. What is the first step in determining a big data strategy?

A. Build In-House Expertise


B. Business Objectives
C. Organizational Buy-In
D. Collect Data

28. According to Ilkay, why is exploring data crucial to better modeling? Data exploration...

A. leads to data understanding which allows an informed analysis of the data.


B. enables understanding of general trends, correlations, and outliers.
C. enables histograms and others graphs as data visualization.
D. enables a description of data which allows visualization.

29. Why is data science mainly about teamwork?

A. Engineering solutions are preferred.


B. Data science requires a variety of expertise in different fields.
C. Exhibition of curiosity is required.
D. Analytic solutions are required.
30. What are the ways to address data quality issues?

A. Merge duplicate records.


B. Remove data with missing values.
C. Generate best estimates for invalid values.
D. Remove outliers.
E. Data Wrangling

31. What is done to the data in the preparation stage?

A. Build Models
B. Retrieve Data
C. Cleaning, Integrating, and Packaging
D. Select Analytical Techniques
E. Identify Data Sets and Query Data

32. Amazon has been collecting review data for a particular product. They have realized that
almost 90% of the reviews were mostly a 5/5 rating. However, of the 90%, they realized that
50% of them were customers who did not have proof of purchase or customers who did not
post serious reviews about the product. Of the following, which is true about the review data
collected in this situation?
A. Low Valence
B. Low Volume
C. High Valence
D. High Volume
E. High Veracity
F. Low Veracity

33. As mentioned in the slides, what are the challenges to data with a high valence?
A. Complex Data Exploration Algorithms
B. Difficult to Integrate
C. Reliability of Data

34. Which of the following are the 6 V's in big data?


A. Velocity
B. Veracity
C. Value
D. Vision
E. Volume
F. Valence
G. Variety
35. What is the veracity of big data?
A. The connectedness of data.
B. The size of the data.
C. The speed at which data is produced.
D. The abnormality or uncertainties of data.

36. What are the challenges of data with high variety?


A. Hard to perform emergent behavior analysis.
B. Hard to integrate.
C. Hard in utilizing group event detection.
D. The quality of data is low.

37. Which of the following is the best way to describe why it is crucial to process data in real-
time?
A. Prevents missed opportunities.
B. More expensive to batch process.
C. More accurate.
D. Batch processing is an older method that is not as accurate as real-time processing.

38. What are the challenges with big data that has high volume?
A. Effectiveness and Cost
B. Storage and Accessibility
C. Cost, Scalability, and Performance
D. Speed Increase in Processing

39. What is the benefit of a commodity cluster?


A. Enables fault tolerance
B. Much faster than a traditional super computer
C. Prevents network connection failure
D. Prevents individual component failures

40. What is a way to enable fault tolerance?


A. Data-Parallel Job Restart
B. Distributed Computing
C. Better LAN Connection
D. System Wide Restart

41. What are the specific benefit(s) to a distributed file system?


A. Data Scalability
B. High Fault Tolerance
C. Large Storage
D. High Concurrency

42. Which of the following are general requirements for a programming language in order to
support big data models?
A. Enable Adding of More Racks
B. Optimization of Specific Data Types
C. Handle Fault Tolerance
D. Support Big Data Operations
E. Utilize Map Reduction Methods

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy