05 MIS11e_ch03wKeyTermsConceptsReviewedEx
05 MIS11e_ch03wKeyTermsConceptsReviewedEx
Bidgoli, MIS, 11th Edition. © 2023 Cengage. All Rights Reserved. May not be scanned, copied or duplicated,
or posted to a publicly accessible website, in whole or in part.
Learning Objectives
2
Databases (1 of 4)
• Database
• Collection of related data that can be stored in a central location or in
multiple locations
• Usually a group of files
• File
• Group of related records
• All files are integrated → information can be linked
• Record
• Group of related fields
• Data hierarchy
• Structure and organization of data, which involves fields, records, and files
3
Databases (2 of 4)
– For the above example, fields consist of social security number, student
name, and address
– All the fields storing information for Mary Smith, for instance, constitute a
record
– All the three records make up the student file
4
Databases (3 of 4)
5
Databases (4 of 4)
• Internal data
• Collected from within an organization
• E.g. transaction records, sales records, and so forth
• Stored in the organization’s internal databases
• External data
• Comes from a variety of sources
• E.g. competitors, customers, suppliers, etc.
• Often stored in a data warehouse
8
Logical Database Design (1 of 2)
• Data model
• Determines how data is created, represented, organized, and
maintained
• Includes:
• Data structure
• Operations
• Integrity rules
10
The Relational Model (1 of 5)
• Relational model
• Uses a two-dimensional table of rows and columns of data
• Rows are records and columns are fields (i.e., attributes)
• Data dictionary
• Stores definitions of each table and the fields in it for the logical
structure of a relational database
• Field name – e.g. student name, age, admission data, etc.
• Field data type – e.g. text, date, and number
• Default value – e.g. value entered if none is available
• Validation rule – e.g. determine whether a value is valid
11
Relational Model (2 of 5)
• Primary key
• Unique identifier (e.g. student ID) for every record
• Foreign key
• Primary key for one table appears in other tables
• Establishes relationships (i.e. data can be linked and retrieved amongst
tables) between tables
• Normalization
• Process to improve database efficiency
• Eliminates redundant data and ensures only related data is stored in a
table
• E.g. storing customer names in only one table
12
Relational Model (3 of 5)
13
Relational Model (4 of 5)
14
Relational Model (5 of 5)
15
Components of a DBMS
16
Database Engine
17
Data Definition
18
Data Manipulation
20
Data Administration
• Include:
• Data-driven Web sites
• Natural language processing (details will be covered in Chapter
13)
• Distributed databases
22
Data-Driven Web Sites
• Distributed database
• Data is stored on multiple servers placed throughout an organization
(in contrast to central database for all users)
• Main reasons for choosing
• Minimizes the effects of computer failures
• Helps reduce communication costs for remote users
• Supports distributed processing
• Not limited by data’s physical location
• Security issues are concerned
• Multiple access points from inside and outside the organization
24
Data Warehouses
• Data warehouse
• Collection of data (from a variety of sources) used to support
decision-making applications and generate business intelligence
• Stores multidimensional data (i.e. hypercubes)
• Characteristics of data stored in data warehouse
• Subject oriented (focused on a specific area)
• Integrated (comes from different sources)
• Time variant (categorized based on time, i.e. historical)
• Type of data (capture aggregated data)
• Purpose (for analytical use)
25
3.6 A Data Warehouse Configuration
Four major components:
i. Input
ii. Extraction, transformation, and loading (i.e. ETL)
iii. Storage
iv. Output
26
Input
27
Extraction, Transformation, and Loading (ETL)
• Extraction
• Collecting data from a variety of sources
• Converting data into a format that can be used in transformation
processing
• Transformation processing
• Make sure data meets the data warehouse’s needs
• Loading
• Process of transferring data to the data warehouse
28
Storage
29
Output (1 of 2)
• Data-mining analysis
• Discover patterns and relationships
• Reports for decision making
====================================================
• A data warehouse can allow you to do:
• Cross-reference segments of an organization’s operations for
comparison purposes
• Find patterns and trends that can’t be found with databases
• Analyze large amounts of historical data quickly
32
Data Mart (1 of 2)
• Data mart
• Smaller version of data warehouse
• Used by single department or function
• Advantages over data warehouses
• Users are targeted better as it is designed for a specific
department or division
• Faster data access because of smaller size
• Less expensive
• Easier to create because of its size and simplicity
33
Data Marts (2 of 2)
• Disadvantages
• Limited scope than data warehouses
• Consolidating information from different departments or
functional areas is more difficult
34
The Big Data Era (1 of 2)
35
The Big Data Era (2 of 2)
36
Database Marketing (1 of 2)
37
Database Marketing (2 of 2)
38
Summary
39
Key Terms
40
Key Concepts
• The join operation in a relational database combines two tables based on a
common field, which helps to reduce data redundancy in conventional “flat
files” systems.
• A distributed database stores data on multiple servers, so as to minimize the
effects of computer failure, reduce communication costs for remote users,
and support distributed processing.
• A data warehouse collects data from a variety of sources to support (through
OLAP and data-mining analysis) decision-making applications and generate
business intelligence.
• Data-mining analysis in the data warehouse helps to discover patterns,
relationships, and trends that cannot be found with single databases alone.
• Big data is powerful not only because its volume is big, but also because it
includes a variety of both structured and unstructured data.
41
Reviewed Exercise
42
43