Unit 6 Backup and Recovery
Unit 6 Backup and Recovery
Backup methods:
Below are various backup methods:
Full Backup
As the name suggests, a full backup of a data set will copy the original set plus the new
changes every time the backup is run. It's a simple process, just copying the whole data
block every time, regardless of what's changed. This method needs a lot of storage space,
so it's best for smaller enterprises. Restoring from a full backup is quick because all the
data on a drive is saved in one location.
Pros
➢ Quick restore time
➢ Managing storage is simple because all the data is stored in one location
➢ File search is very simple.
Cons
➢ Requires the most storage space.
➢ Depending on their size, it takes a long time to back up files
➢ The need for additional storage space makes it the most expensive backup
method
➢ The risk of data loss is high since all the data is stored in one place
Incremental Backup
Incremental backup was introduced to save space and time compared to full data
copying. It only backs up what's changed since the last backup, making it more efficient.
However, recovering data might take longer because you need to use all the incremental
backups in the chain. And each increment depends on the previous one. If any of the
increments is corrupted, you may not be able to restore any data from that point onward.
Pros
➢ Very quick to backup
➢ Low storage requirements in comparison to full backups
Cons
➢ Restoring takes time because data has to be gathered from various backups.
➢ You can only recover successfully if all backup files are undamaged.
➢ Searching for files is difficult - you have to check multiple backup sets to find a
specific file.
Differential Backup
A differential backup copies all data that has changed since the initial full backup. Even if
there are no updates between backups, the same data is copied again. Restoring data
using the differential model is quick, requiring only two steps: recovering the initial full
copy and applying the last differential set.
Pros
➢ Restores faster than incremental backups since only data from the recent full
backup and the latest differential backup are required.
➢ Requires less storage space than full backups.
Cons
➢ Restoring is slower than full backups
➢ Takes up more storage space than incremental backups
When should you use differential backup?
Small and medium-sized organizations handling large volumes of valuable data, but
unable to do frequent backups, will benefit from using the differential backup method.
CD Technology:
CDs are optical discs used for data storage. CD-Rs can only be written to once, while
CD-RWs support multiple writes. They typically store 700MB of data. CDs are portable,
lightweight, and cost-effective for both small and large backups. Most modern PCs have
CD-R/RW drives.
However, if your computer doesn't have one, you'll need to invest in a CD-R/RW drive.
CD-RWs can suffer from human error and lose reliability over time due to degradation
from handling and environmental factors. Blu-ray discs, introduced in 2006, can store up
to 25GB of data but require a specific drive and software for burning data.
DVD Technology:
DVDs are optical discs used for data storage, like CDs, but they can store more data.
Single-layer DVDs can hold up to 4.7GB, while double-layer DVDs can hold up to 8.5 GB.
There are rewritable (DVD-RW) and recordable (DVD) formats available. DVDs are
portable and convenient for transporting data.
You should buy a DVD writing drive if your PC doesn't have one. DVD-RWs lose reliability
with each recording, so they're not ideal for long-term storage. Like CDs, DVDs can fail
due to outside factors and are easy to misplace because they're small and portable.
Online storage:
Businesses offer online data backup services, storing your data in their data centers. This
protects your data from disasters at your home or office and allows you to access it from
multiple locations. However, there are concerns about data security. If the company's
servers go down, you may lose access to your data, causing potential loss of time and
money. Hacking is also a risk, potentially leading to identity theft or exposure of trade
secrets. If the data storage company fails, you could lose your backup and have to start
over, and the monthly costs can be high.
So how is it work?
Grandfather: A monthly full backup is done once a month, such as on the last day of the
month, and the data is stored off-site or in the cloud.
Father: A weekly full backup is done once a week, for example, every Monday, and the
data is saved locally. You can choose any specific day that works for you.
Son: Perform a daily incremental or differential backup, done once a day or even more
frequently, like hourly, and save it locally.
Some backups are removed for safety or disaster recovery. The grandfather-father-son
backup method follows the first in, first out rule, deleting the oldest backups first to make
room for new ones.
RAID Levels
RAID is classified into different levels based on the performance and reliability
parameters. The following are the important RAID levels:
RAID level 0
In RAID 0, data is divided into blocks and written to all drives simultaneously, improving
I/O performance by using multiple disks together. It fully utilizes all disk storage and is
simple to set up.
However, RAID 0 lacks fault tolerance. If one drive fails, all data is lost because it's spread
across multiple drives.
It's great for tasks where speed matters most, and data loss isn't a concern.
RAID level 1
In RAID level 1, data is copied to another disk, known as mirroring. This offers fast reading
speeds and better fault tolerance since there's a backup. If one disk fails, the mirrored
disk is used.
However, RAID 1 needs double the resources. It's best for vital data storage, like
accounting systems, where security is crucial.
RAID level 10 (1+0)
This method blends RAID 0 and RAID 1 benefits into one system. It offers security by
copying all data to secondary drives while enhancing data transfer speeds through
striping across drive sets.
The main advantage of RAID 10(1+0) is that recovery from failures because of data
mirroring. However, like RAID 0, it sacrifices half of the storage capacity for mirroring.
RAID level 5
This level uses block-level striping with parity. Parity info is spread across drives, allowing
the array to function if one drive fails. It's faster than a single drive but slower than RAID
0. RAID 5 needs at least three disks, but using five is often advised for better performance.
RAID 5 isn't recommended for systems with lots of writing because writing parity data
slows it down. Rebuilding a RAID 5 array after a disk failure can also take a while.
RAID level 6
RAID 6 is like RAID 5, but with a difference: in RAID 6, parity data is written to two disks.
This means you need at least four disks for RAID 6. Plus, RAID 6 can handle two drive
failures at once.
While RAID 6 offers more security, it's slower in writing because of extra parity data. It's
chosen over RAID 5 for better fault tolerance. RAID 6 balances storage efficiency,
security, and performance well.
Recovery Techniques
Different Types of Data Recovery Systems
Organizations use different data recovery systems to retrieve lost data to choose the right
one for your company, evaluate your data and systems thoroughly first. Once you've
assessed your data, you can select the appropriate recovery method for your business
from various options available.
It also maintains a log of data changes since the last backup and uses two backup
methods. CDP systems are easy to install, and recovery time is quick if they have disk
storage. Many companies are adopting CDP because it continuously monitors and backs
up data updates onsite and offsite.