Basis Midterm Database

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Information Technology Concepts: Database System and Big Data

A. Why Learn About Organizing Data?

B. Data Fundamentals

C. Data Modeling and Database Characteristics

D. Relational Database Management Systems (DBMSs)

E. Big Data

F. F. Technologies Used to Process Big Data


A. Why Learn About Organizing Data?

* tools and processes that enable users to manage all this data so that it
can be used to uncover new insights and make effective decisions.

It ensures efficiency, accuracy, and the ability to


extract valuable insights that can inform better
decision-making

A database is a well-designed, organized, and carefully managed collection of data

database management system (DBMS): A group of programs used to access and


manage a database as well as provide an interface between the database and its
users and other application programs

hierarchy of data: Bits, characters, fields, records, files, and databases


Data Fundamentals:
bit: A binary digit (i.e., 0 or 1)
that represents a circuit that is
either on or off.

character: A basic building block


of most information, consisting of
uppercase letters, lowercase
letters, numeric digits, or special
symbols.

field: Typically a name, a number,


or a combination of characters
that describes an aspect of a
business object or activity.

record: A collection of data


fields all related to one object,
activity, or individual.

file: A collection of related


records.
Data is one of the most valuable resources that a firm possesses.

It is organized into a hierarchy that builds from the smallest element to the
largest.
The smallest element is the bit, a binary digit.
A byte (a character such as a letter or numeric digit) is made up of eight
bits. A group of characters, such as a name or number, is called a field (an
object). A collection of related fields is a record; a collection of related
records is called a file. The database, at the top of the
hierarchy, is an integrated collection of records and files.

An entity is a generalized class of objects (such as a person, place, or thing)


for which data is collected, stored, and maintained. An attribute is a
characteristic of an entity. Specific values of attributes—called data items—
can be found in the fields of the record describing an entity.
A data key is a field within a record that is used to identify the record. A
Data Entities, Attributes, and Keys

An entity is a generalized class of objects (such as a person, place, or thing)


for which data is collected, stored, and maintained. An attribute is a
character istic of an entity. Specific values of attributes—called data items—
can be found in the fields of the record describing an entity. A data key is a
field within a record that is used to identify the record. A primary key
uniquely identifies a record, while a secondary key is a field in a record that
does not uniquely
identify the record.
Data modelling is a key aspect of organizing data and
information.
When building a database, an organization must consider content, access,
logical structure, physical organization, archiving, and security of the database.

One of the tools that database designers use to show the logical structure and
relationships among data is a data model. A data model is a map or diagram
of entities and their relationships. Enterprise data modeling involves analyzing
the data and information needs of an entire organization and provides a
roadmap for building database and information systems by creating a single
definition and format for data that can ensure compatibility and the ability to
exchange and integrate data among systems.
An enterprise data model involves analyzing the data and information
needs of an entire organization and provides a roadmap for building
database and information systems by creating a single definition and
format for data that can ensure compatibility and the ability to
exchange and integrate data among systems.
Entity-relationship (ER) diagrams can be used to show the relationships
among entities in the organization. (key components : entities, attributes,
relationships)
Here are some of the benefits of using
ERDs:

•Improved database design: ERDs help


visualize the structure of a database,
making it easier to identify potential
flaws or inefficiencies before
implementation.

•Enhanced communication: They


provide a clear and common
language for both technical and non-
technical stakeholders to understand
the database structure.

•Efficient data management: ERDs


can help optimize data storage and
retrieval by structuring the database
in a way that facilitates efficient
Relational Database Model
The relational database model is a simple but highly useful way to organize data into collections of two-
dimensional tables called relations. Each row in the table represents an entity, and each column
represents an attribute of that entity
Manipulating Data
Each attribute can be constrained to a range of
allowable values called its domain. The domain for a
particular attribute indicates what values can be
placed in each column of the relational table.
After entering data into a relational database, users can
make inquiries and
analyze the data. Basic data manipulations include selecting,
projecting, and
joining. Selecting
Projecting involves
involves eliminating
eliminating rows
columns in aaccording
table. to
certain criteria.
Joining involves combining two or more tables.

Linking, the ability to combine two or more tables through


common data attributes to form a new table with only the
unique data attributes, is one of the keys to the flexibility and
power of relational databases.
Data cleansing (data cleaning or data
scrubbing): The process of detecting and then
correcting or deleting incomplete, incorrect,
inaccurate, or irrelevant records that reside in
a database.
The goal of data cleansing is to improve the quality of the
data used in decision making. The “bad data” may have
been caused by user data-entry errors or by data
corruption during data transmission or storage. Data
cleansing is different from data validation, which involves
the identification of “bad data” and its rejection at the
time of data entry.
DBMS is a group of programs used as an interface between a database and application
programs or between a database and the user.
Structured Query Language (SQL): A special-purpose programming language for
accessing and manipulating data stored in a relational database. (specific language used
to interact with relational databases). It allows you to perform tasks like fetching data,
updating records, and creating new tables. Think of SQL as the tool you use to
communicate with a database
ACID (Atomicity, Consistency, Isolation, Durability) is a set of properties that define
how a database transaction should behave. ACID ensures data integrity by
guaranteeing certain outcomes for database operations. Imagine ACID as the rules that
govern how data is handled within a database.

In short:
•SQL is a language for relational databases.
•ACID defines properties for database transactions.
•Most SQL databases are ACID-compliant, but not
all.
•NoSQL databases often have different consistency
models than ACID.
Database Activities
Databases are used to provide a user view of the database, to add and modify data, to
store and retrieve data, and to manipulate the data and generate reports.
schema: A
description
that defines
the logical and
physical
structure of
the database
by identifying
the tables, the
fields in each
table, and the
relation ships
between fields
and tables.

data definition language (DDL): A


collection of instructions and com
ands used to define and describe data
and relationships in a specific
database. A DDL allows the database’s
creator to describe the data and The use of the letter X - reveals where specific information
concerning the database should be entered. File description,
relationships that are to be contained area description, record description, and set description are
terms the DDL defines and
in the schema. In general, a DDL uses in this example. Other terms and commands can also be
describes logical access paths and used, depending on the DBMS employed.

logical records in the database.


data dictionary: A detailed description of all the data used in the
database.
● Name of the data item
● Aliases or other names that may be used to
describe the item
● Range of values that can be used
● Type of data (such as alphanumeric or numeric)
● Amount of storage needed for the item
● Notation of the person responsible for updating it
and the various users who can access it
● List of reports that use the data item

A data dictionary is a valuable tool for maintaining an efficient


database that stores reliable information with no redundancy,
and it simplifies the process of modifying the database when
necessary. Data dictionaries also help computer and system
programmers who require a detailed description of data
elements stored in a database to create the code to access the
data.
Storing and Retrieving Data
One function of a DBMS is to be an
interface between an application program
and the database. When an application
program needs data, it requests the data
through the DBMS.

concurrency control: A method of


dealing with a situation in which
two or more users or applications
need to access the same record at
the same time.
Manipulating Data and Generating
Reports
After a DBMS has been installed,
employees, managers, and other
authorized users can use it to review
reports and obtain important
information. Using a DBMS, a company
can manage this requirement. Some
databases use Query by Example (QBE),
which is a visual approach to developing
database queries or requests. With QBE,
you can perform queries and other
database tasks by opening windows and
clicking the data or features you want—
similar to the way you work with
Windows and other GUI (graphical user
interface) operating systems and
applications
data manipulation language (DML): A specific language, provided with a
DBMS, which allows users to access and modify the data, to make queries,
and to generate reports.
This specific language, provided with the DBMS, allows managers and other database users to access
and modify the data, to make queries, and to generate reports. Again, the application programs go
through schemas and the DBMS before getting to the data stored on a device such as a disk.

After a database has been set up and loaded with data, it can produce desired reports, documents,
and other outputs

A DBMS can produce a wide variety of documents, reports, and other out
put that can help organizations make decisions and achieve their goals. Often,
organizations have standard reports that are run on a regular basis. The
most common reports select and organize data to present summary
information about some aspect of company operations.
database administrators (DBAs): Skilled and trained IS professionals who hold
discussions with business users to define their data needs; apply database
programming languages to craft a set of databases to meet those needs; test
and evaluate databases; implement changes to improve the performance of
databases; and assure that data is secure from unauthorized access
The role of the DBA is to plan, design, create, operate, secure, monitor,
and maintain databases. Typically, a DBA has a degree in computer science
or management information systems and some on-the-job training with a
particular database product or more extensive experience with a range of
database products
data administrator: An individual responsible for defining and implementing
consistent principles for a variety of data issues. (including setting data
standards and data definitions that apply across all the databases in an
organization)
database as a service (DaaS): An arrangement where the database is stored on a
service provider’s servers and accessed by the service subscriber over a network,
typically the Internet, with the database administration handled by the service
provider. (examples Amazon, Database.com, Google, Heroku, IBM, Intuit,
Microsoft, MyOwnDB, Oracle, and Trackvia)
Using Databases with Other Software

Database management systems are often used with other software and to interact with users
over the Internet.

A front-end application is one that people interact with directly.

A back-end application interacts with other programs or applications; it


only indirectly interacts with people or users.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy