Implementing Data Governance
Implementing Data Governance
Implementing Data Governance
Implementing
Data Governance
How to Develop a Strategy,
Determine a Value Proposition,
and Build a Business Case
BUSINESS CASE
VA L U E P R O P O S I T I O N
STRATEGY
AUTHORS:
Contents
3 Executive Summary
31 Conclusion
Executive Summary
Traditional Since data was first stored and managed, systems that enable
data governance users to locate and work collaboratively on it have been essen-
strategies and toolsets tial—think of the antiquated referencing systems that still exist in
cannot effectively some libraries today.
manage the terabytes
of data accumulated However, traditional data governance strategies and toolsets
since the advent of cannot effectively manage the terabytes of data accumulated
the internet. since the advent of the internet.
Zettabytes 180
160
140
120
100
80
60
40
20
Data created 0
2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020
2021 2022 2023 2024 2025
1
Reinsel, David, Gantz, John, Rydning, John.
“The Digitization of the World From Edge to Core.” Seagate, November, 2018,
https://www.seagate.com/files/www-content/our-story/trends/files/dataage-idc-report-final.pdf
(PDF version downloaded November, 2020)
Splitting the storage and management of data like this can only lead
to confusion—unless there is a substantial governance strategy in
place.
With so much
data and so many Organizations face similar obstacles when rolling out company-wide
advanced analysis tools data-focused solutions too. For example, single departments may
available, it’s possible to require specific datasets at a particular time. This data is likely to
obtain more and more concern a different customer group or product to another depart-
value from combined ment.
data sets.
Even if the product or customer data is from the same source, it
could have undergone subtle changes that expose personal identifi-
able information (PII) or alter its meaning.
2
European Commission. “Data Protection in the EU.” Accessed November, 2020,
https://ec.europa.eu/info/law/law-topic/data-protection/data-protection-eu_en
3
UNCTAD. “Data Protection and Privacy Legislation Worldwide.” Accessed November, 2020,
https://unctad.org/page/data-protection-and-privacy-legislation-worldwide
4
Dataversity. “The Difference
Between Data Governance & Data
Management.” Accessed November,
2020, https://www.dataversity.
net/the-difference-between-data-
governance-data-management/
Data Architecture
Data architecture group Define the current state of data in the organization
responsibilities
Provide a standard business vocabulary for data and components
From a governance perspective, this group plays a vital role in the overall
data governance strategy. In most cases, this group is either:
Oversee projects
Governance Create standards for data modeling and ensure the entire organi-
responsibilities zation follows these standards
Key goals Manage the availability of data throughout its life cycle
From a governance Various storage metrics, like the number of databases, transactional
perspective, the following statistics, capacity metrics, storage metrics, number of requests, and
should be available improvement services
Information asset tracking tools that ensure the company meets license
requirements and the cost of ownership is understood
Data Security
But this challenge doesn’t stop at external threats. Data security proto-
cols should also prevent unauthorized users within an organization from
intentionally or unintentionally accessing or manipulating prohibited data
sets too.
Data security goals Enable appropriate and prevent inappropriate access to enterprise data
assets
Understand and comply with all relevant regulations and policies for
privacy, protection, and confidentiality
Ensure that the privacy and confidentiality needs of all stakeholders are
both enforced and audited
To achieve these goals, IT security teams use various tools and tech-
niques like encryption, antivirus software, malware attack prevention,
and more. Database administrators and IT security teams are usually
responsible for managing data security.
Data integration Provide data securely, with regulatory compliance, in the format and
goals time frame required
Data integration Data Sharing Agreements: Before developing interfaces or the provision
requirements of data electronically, there was a requirement to establish a data shar-
ing agreement or memorandum of understanding (MOU). This agree-
ment stipulates the responsibilities and acceptable use of data to be
exchanged and is approved by the business data stewards of the data in
question.
Data Lineage: Data lineage is a crucial requirement for data governance
as, without it, you cannot conduct any impact analysis when making data
changes. Analysts must document the data lineage and confirm how
data is flowing from one system to another.
Metrics of Data Integration: Metrics are required to measure the scale
and benefits of a data integration solution on availability, usage, volume,
cost, and speed.
Data exists in many formats such as PDF, text file, JPG, or one of
many other document types. Everything else, like MP3 or MP4,
can be considered content and is stored in any format other than
RDBMS (Relational Database Management Systems).
You must have governance in place for unstructured data for the
following reasons:
The following are the Define MDM drivers and requirements by analyzing your business
MDM activities
Evaluate and assess all data sources
Data Warehousing
and Business Intelligence
From here, data analysts are theoretically able to access any data
they need to make crucial business decisions. Traditionally, IT teams
used an Extract, Transform, and Load process (ETL) to upload and
store data in a data warehouse. In this manner, data is moved in
batches and on daily schedules. But there’s a limit to how much you
can move at once, so traditional data warehouses often require up-
dating. This method also requires a lot of resources like CPU, memo-
ry, and bandwidth.
Metadata
Metadata is data in the fine print, the information used to find and
categorize data. As well as making data discoverable, you can use
metadata to find common relationships between data sets too.
Data Quality
Good data quality improves the overall usage of data and makes
data-driven decisions a reality. Good data quality is one of the
primary objectives of a data governance program. If you follow all
of the processes explored in this section, you end up with quality
data. Teams need to tag data accurately for provenance, it needs
to be stored safely and securely, and it must be well-referenced.
Data quality team Develop a governed approach to make data fit for the needs of
objectives consumers
In the early days of data governance, there was a great deal of focus on
developing specific data architecture. Now, using modern data gover-
nance technologies, like the OvalEdge data catalog and governance tool,
architecture diagrams are automatically built using the raw data.
Individual depart- When presented and managed correctly, it’s possible to convert data
ments and employees can into trusted insights that lead to better business decisions. This pro-
use the wealth of data and cess also drives innovation in an organization.
insights at their fingertips
to develop game-changing Individual departments and employees can use the wealth of data
ideas with company-wide and insights at their fingertips to develop game-changing ideas with
implications. company-wide implications. But before an organization can innovate
with its data, several processes need to be completed, or at least
activated, first.
DATA LITERACY
Data literacy is all about education. It’s the process by which an or-
ganization puts in place measures to ensure all data users within that
organization receive education to a level that enables them to con-
sume data confidently.
To build a culture where users can effectively utilize data, the way you distrib-
ute, store, and manage it must be transparent. This transparency leads to trust
in data, and when users consider data trustworthy, they will do more with it.
You don’t have to make all the data in your organization available to
everyone to be transparent. This action would lead to scenarios where
protected or confidential data is made available to unauthorized users, in
opposition to internal and external regulatory requirements.
Instead, a company should clearly state where and what its data is,
where it’s coming from, who is using it, who owns it, and whom to con-
tact if you need access to it. The OvalEdge data catalog is one such way
for organizations to present the data available to users while withholding
access to restricted information.
With technology like this in place, organizations can build trust by dis-
playing the location and content of the available data without necessarily
giving everyone access.
ACCESS
In an ideal scenario, all users have equal access to the data they re-
quire—as long as no restrictions are in place to protect PII or other
information. This scenario is achievable through smart cataloging and
classification.
The aim is to make data searchable so users can access it easily. When
dealing with teams of people who are becoming more data literate but
aren’t proficient data analysts, accessibility is incredibly important. Not
only does it instill confidence in newly literate business users, but it also
encourages experimentation.
SELF-SERVICE ANALYTICS
The more systems that are put in place to streamline the data gover-
nance process and track KPIs, the better the quality of data becomes.
Every company department has a specific business purpose for the data
they access. When data literate staff can access and analyze this data, they
can determine the correct KPIs required to track how well it’s performing.
Again, modern smart technology enables users to set and access these
KPIs once data is unified and organized in one place.
Laws governing As we mentioned earlier on in this whitepaper, the first data governance
data privacy now exist strategies were developed by the banking industry to aid compliance. Al-
in different countries, though not the only factor, compliance is still a driving force behind data
regions, and even states governance practices today.
The difference is, the number of laws and regulations has erupted, and
so too have the internal data policies of many organizations. Laws gov-
erning data privacy now exist in different countries, regions, and even
states, and as businesses operate at a global level, it’s crucial to comply
with these laws—regardless of where you are based.
Today, failing to Today, failing to follow data protection regulations could result in stag-
follow data protection gering fines. For example, any organization that breaks the GDPR could
regulations could result face fines up to 20 million euros or 4% of global annual revenue5.
in staggering fines.
There are three key areas to consider if you wish to address compliance
issues in modern data governance: standardization, knowledge, and
lineage.
5
GDPR.EU. “What are the GDPR Fines?” Accessed November, 2020, https://gdpr.eu/fines/
STANDARDIZATION
CLASSIFICATION
The next part is knowing where to put it. With a data catalog like
OvalEdge’s, users can easily search for and find information about the
data they are using. This process is incredibly useful when it comes to
ensuring compliance.
LINEAGE
Lineage is vital for tracking data. With processes in place to confirm data
lineage, it’s possible to see if and how anyone has altered the data, who
it belongs to, and what important information it contains.
Most business Today, most business executives expect Chief Information Officers
executives expect Chief (CIO) and Chief Data Officers (CDO) to do more with fewer resourc-
Information Officers (CIO) es. Budgets are getting cut, and there is a powerful drive to trans-
and Chief Data Officers form existing data models into modern systems.
(CDO) to do more with
fewer resources. This outcome is now the key objective, but data teams will only real-
ize this goal by following several fundamental processes first.
DATA DISCOVERY
IMPACT ANALYSIS
METADATA MANAGEMENT
You must fully understand the data you are using if you want to get
the most out of it. For this reason, one of the most important driv-
ers of efficient data analysis is managing this metadata.
The vital point that data and IT teams need to grasp is that more
metadata does not equate to more useful information. Instead, the
primary utility of metadata comes from how it’s managed.
ROI for modern Determining an exact return on investment (ROI) for a data governance
data governance is program is difficult. There are some areas where it’s possible to calculate
measured by use case. ROI and others where it isn’t—right now. However, when looking at ROI as
The most important of a whole, you can combine all of these elements from different use cases
these are innovation and to confirm the benefits of stringent data governance provisions.
data-driven decisions
As we have explained, modern data governance is value-driven and
always use-case specific. Smart technology calculates and comprehends
pain points and develops solutions based on value.
ROI for modern data governance is measured by use case. The most
important of these are innovation and data-driven decisions, the ROI on
which can be exponential if you implement strategies correctly. Howev-
er, the ROI of data-driven innovation is only calculable a reasonably long
time after the event.
For this reason, let’s start by documenting the ROI of increased efficien-
cy, the use case that’s easiest to calculate.
Efficiency
Calculating the Calculating the ROI of data governance in regards to efficiency is straight-
ROI of data governance forward—you should recuperate your initial investment relatively quickly.
in regards to efficiency
is straightforward— Improved efficiency within data teams leads to time saved within these
you should recuperate specific teams and, because there are platforms in place to make data
your initial investment processes more efficient and lead to increased data literacy, organiza-
relatively quickly. tions record savings downstream too.
Self-service has a significant impact on ROI. When users can access and
use data themselves without the need to go via a member of the data or
IT team, the ROI is staggering. One report by Forrester included analysis
from seven companies that had used a modern data governance tool simi-
lar to the OvalEdge data catalog. Here are the findings over three years:
Compliance
The EU’s GDPR It’s almost impossible to calculate the ROI through compliance. However,
has the highest penalty, what you can do is calculate the risks you mitigate by ensuring that you
but there are some other have a data auditing strategy in place. The best way to illustrate this is to
significant fines too. look over the most comprehensive data protection regulations and eval-
uate what you stand to lose if you don’t comply with them—bear in mind
any internal data or privacy protection protocols too.
Unsurprisingly, the EU’s GDPR has the highest penalty, but there are
some other significant fines too.
6
Forrester. “A Forrester Total Economic ImpactTM Study Commissioned By Alation.” Alation, October,
2019, https://www.alation.com/wp-content/uploads/Forrester-TEI-Alation-Final-10.08.2019.pdf (PDF
version downloaded November, 2020)
7
State of California Department of Justice. “California Consumer Privacy Act (CCPA).” Accessed
November, 2020, https://www.oag.ca.gov/privacy/ccpa
8
Dubai International Financial Centre. “Data Protection Law DIFC Law No. 5 of 2020.” Dubai
International Financial Centre, October, 2020, https://www.difc.ae/files/6115/9358/6486/Data_
Protection_Law_DIFC_Law_No.5_of_2020.pdf (PDF version downloaded November, 2020)
Failure to follow the Lei Geral de Proteção de Dados (LGPD)9 can result in
penalties equal to 2% of a private legal entity’s, group’s, or conglomerate’s
Brazillian revenue minus taxes for the previous tax year, or a fine of 50
million reals ($9 million).
Data-Driven Decisions
Value through innovation is the most difficult ROI to calculate. Firstly, this
is a company by company consideration and every single entity will use
its data to innovate in different ways. How much they get from the data
is dependent on how much they are willing to invest in data governance
strategies. Secondly, data-driven decisions drive value over the long-term
so it takes at least a year—more likely two to three years—to get to a
stage where it’s possible to analyze outcomes.
Business leaders Business leaders make decisions every day. When these decisions are
make decisions every backed by trusted insights, it leads to better outcomes. This directly af-
day. When these fects a company’s top or bottom line. In terms of innovation by creating
decisions are backed by new data products, this is entirely determined by the sporadic nature of
trusted insights, it leads innovation and the independent speeds that it progresses at.
to better outcomes.
Even with a solid data delivery platform in place and all the data required
to innovate at a user’s fingertips, it’s very difficult to predict when and
how innovation will occur.
9
GDPR.EU. “What is the LGPD? Brazil’s version of the GDPR.” Accessed November, 2020,
https://gdpr.eu/gdpr-vs-lgpd/
10
Singapore Statutes Online. “Personal Data Protection Act 2012.” Accessed November, 2020,
https://gdpr.eu/gdpr-vs-lgpd/
11
Alkudmani, Fares. “GDPR: The 6 Biggest Fines Enforced by Regulators So Far.” Secure Privacy (blog),
January, 2020, https://secureprivacy.ai/gdpr-the-6-biggest-fines-enforced-by-regulators-so-far/
Over time, teams will build more use cases with the technology avail-
able to them and eventually there will come a pivot point where this
new use case, say a recommendation engine, for example, is rolled
out.
Even at this point, however, it’s better to analyze the ROI only after
lots of people have begun to use the technology. Although it’s difficult
to calculate ROI in regards to innovation, what is certain is that data is
the foundation for digital transformation and every company is aware
of its importance.
Imagine if you owned a plot of land and underneath this land, unbe-
knownst to you, was an untapped reserve of crude oil. You know the
value of crude oil but you don’t know that this great wealth is right
below you. Until you explore it you will never be able to realize its
value.
Calculating ROI:
OvalEdge Case Study
Conclusion
With big data playing such an integral part in the growth, success,
and evolution of businesses in every sector, data governance is
already one of the most important considerations for management
teams.