Unit - 1

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 5

Types of Business Analytics

Part B: Big data


a. DIKW Model, Data and Big Data
b. 4Vs of Big Data
c. Sources of Big Data
d. Data Quality
e. Role of a Data Analyst

What is Big Data?


According to Wikipedia, Big Data is the term for a
collection of data sets so large and complex that it becomes
difficult to process using in-hand database management tools,
or traditional data processing application.
According to IBM, Big Data is a term applied to data sets
whose size or type is beyond the ability of traditional data
processing to capture, manage and process the data.

Global Data Generated Storage Capacity on a Digital Data Storage


Every Day Single Drive Worldwide
294 billion emails 1956: Digital photos 2018: 4.4 zettabytes
230 million tweets 2020: 12 million digital 2020: 44 zettabytes
photos
1+ billion google searches

1GB = 1000 Megabytes


1TB = 1000 Gigabytes
1PB = 1000 Terabytes
1EB = 1000 Petabytes
1ZB = 1000 Exabytes

Volume (Data at Rest) – Terabytes to Exabytes of existing data to process.


Velocity (Data in Motion) – Streaming data, requiring milliseconds to seconds to responds.
Variety (Data in many forms) – Structured, Unstructured, text and multimedia.
Veracity (Data in doubt) – Uncertainty due to data inconsistency and incompleteness,
ambiguities, latency, deception, model
Value (Data into money) – Business model can be associated to the data.

Internal Sources of Big Data


 Corporate ERP Modules
 Internal Documents
 Sensors, controllers
 In-house call centers
 Website logos

External Sources of Big Data


 Social Media
 Official Statistics
 Weather forecast
 Publicly available data sets for machine learning
Big Data Analytics Decisions
Big Data according to IBM
Overview

Big data analytics is the use of advanced analytic techniques against very large, diverse big data
sets that include structured, semi-structured and unstructured data, from different sources, and in
different sizes from terabytes to zettabytes.

What is big data exactly? It can be defined as data sets whose size or type is beyond the ability of
traditional relational databases to capture, manage and process the data with low latency.
Characteristics of big data include high volume, high velocity and high variety. Sources of data
are becoming more complex than those for traditional data because they are being driven by
Artificial Intelligence (AI), mobile devices, social media and the Internet of Things (IoT). For
example, the different types of data originate from sensors, devices, video/audio, networks, log
files, transactional applications, web and social media — much of it generated in real time and at
a very large scale.
With big data analytics, you can ultimately fuel better and faster decision-making, modelling and
predicting of future outcomes and enhanced business intelligence. As you build your big data
solution, consider open-source software such as Apache Hadoop, Apache Spark and the entire
Hadoop ecosystem as cost-effective, flexible data processing and storage tools designed to
handle the volume of data being generated today.

Benefits of Big Data Analytics


Faster, better decision making
Businesses can access a large volume of data and analyze a large variety source of data to gain
new insights and take action.  Get started small and scale to handle data from historical records
and in real-time.

Cost reduction and operational efficiency


Flexible data processing and storage tools can help organizations save costs in storing and
analyzing large amounts of data.  Discover patterns and insights that help you identify do
business more efficiently. 

Improved data-driven go to market


Analyzing data from sensors, devices, video, logs, transactional applications, web and social
media empowers an organization to be data-driven.  Gauge customer needs and potential risks
and create new products and services.
CIA- 1 Guidelines (Topic- Applications of Business Analytics in “Accounting”)

25 mins
+ 5 mins buffer
Technical glitch, present without ppt
Examples, pictures, cases

Unit- 2: Part B DBMS and RDBMS

i. Database Terminologies
ii. DBMS and RDBMS
iii. RDBMS Data Structure
iv. Columnar Data Structures

Database Terminologies
i. Data Warehouse (unstructured)
ii. Data Mart (structured)
iii. Database

Data Catchment Area

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy