Combined Quizes
Combined Quizes
Combined Quizes
NIM : 20.52.1330
2. What reasoning was given for the following: why is the "data storage to price ratio"
relevant to big data?
A. Companies can't afford to own, maintain, and spend the energy to support large data
storage unless the cost is sufficiently low.
B. It isn't, it was just an arbitrary example of big data usage.
C. Lower prices mean larger storage becomes easier to access for everyone,
creating bigger amounts of data for client-facing services to work with.
D. Larger storage means easier accessibility to big data for every user because it allows
users to download in bulk.
4. Of the following, which is an example of personalized marketing related with big data?
A. A survey that asks your age and markets to you a specific brand
B. News outlets gathering information from the internet in order to report them to the
public.
C. Google ordering ads to show items based on recent and past search results.
8. Which is the most compelling reason why mobile advertising is related to big data?
A. Mobile advertising in and of itself is always associated with big data.
B. Mobile advertising allows massive cellular/mobile texting to a wide audience, thus
providing large amounts of data.
C. Mobile advertising benefits from data integration with location which requires big
data.
D. Since almost everyone owns a cell/mobile phone, the mobile advertising market is large
and thus requires big data to contain all the information.
12. Of the three data sources, which is the hardest to implement and streamline into a model?
A. Machine Data
B. Organizational Data
C. People
13. Which of the following summarizes the process of using data streams?
14. Where does the real value of big data often come from?
A. Size of the data.
B. Having data-enabled decisions and actions from the insights of new data.
C. Using the three major data sources: Machines, People, and Organizations.
D. Combining streams of data and analyzing them for new insights.
16. What does the term "in situ" mean in the context of big data?
A. Accelerometers.
B. In the situation
C. The sensors used in airplanes to measure altitude.
D. Bringing the computation to the location of the data.
17. Which of the following are reasons mentioned for why data generated by people are hard to
process? Choose all that apply.
18. What is the purpose of retrieval and storage; pre-processing; and analysis in order to convert
multiple data sources into valuable data?
19. Which of the following are benefits of organization-generated data? Choose all that apply.
A. Higher Sales
B. Customer Satisfaction
C. Better Profit Margins
D. Improved Safety
E. High Velocity
20. What are data silos and why are they bad?
A. A giant centralized database to house all the data production within an organization. Bad
because it hinders opportunity for data generation.
B. Highly unstructured data. Bad because it does not provide meaningful results for
organizations.
C. Data produced from an organization that is spread out. Bad because it creates
unsynchronized and invisible data.
D. A giant centralized database to house all the data produces within an organization. Bad
because it is hard to maintain as highly structured data.
21. Which of the following are benefits of data integration? Choose all that apply.
22. Which of the following are parts of the 5 P's of data science and what is the additional P
introduced in the slides?
A. Process
B. People
C. Purpose
D. Product
E. Perception
F. Platforms
G. Programmability
23. Which of the following are part of the four main categories to acquire, access, and retrieve
data?
A. Text Files
B. Remote Data
C. NoSQL Storage
D. Traditional Databases
E. Web Services
24. What are the steps required for data analysis?
25. Of the following, which is a technique mentioned in the videos for building a model?
A. Evaluation
B. Analysis
C. Investigation
D. Validation
26. What is the first step in finding a right problem to tackle in data science?
28. According to Ilkay, why is exploring data crucial to better modeling? Data exploration...
A. Build Models
B. Retrieve Data
C. Cleaning, Integrating, and Packaging
D. Select Analytical Techniques
E. Identify Data Sets and Query Data
32. Amazon has been collecting review data for a particular product. They have realized that
almost 90% of the reviews were mostly a 5/5 rating. However, of the 90%, they realized that
50% of them were customers who did not have proof of purchase or customers who did not
post serious reviews about the product. Of the following, which is true about the review data
collected in this situation?
A. Low Valence
B. Low Volume
C. High Valence
D. High Volume
E. High Veracity
F. Low Veracity
33. As mentioned in the slides, what are the challenges to data with a high valence?
A. Complex Data Exploration Algorithms
B. Difficult to Integrate
C. Reliability of Data
37. Which of the following is the best way to describe why it is crucial to process data in real-
time?
A. Prevents missed opportunities.
B. More expensive to batch process.
C. More accurate.
D. Batch processing is an older method that is not as accurate as real-time processing.
38. What are the challenges with big data that has high volume?
A. Effectiveness and Cost
B. Storage and Accessibility
C. Cost, Scalability, and Performance
D. Speed Increase in Processing
42. Which of the following are general requirements for a programming language in order to
support big data models?
A. Enable Adding of More Racks
B. Optimization of Specific Data Types
C. Handle Fault Tolerance
D. Support Big Data Operations
E. Utilize Map Reduction Methods