SRM Mod 1 DR Intro
SRM Mod 1 DR Intro
Recovery
Module 1
1-1
Course Map
SRM
Foundations
SRM Installation
and Configuration
Introduction to
Disaster Recovery
Array
Managers
SRM Operations
SRM Alarms
and Site Status
Troubleshooting
SRM Overview
and Architecture
Inventory
Mappings
SRM Planning
SRM Installation
and Configuration
SRM Installation
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
Protection
Groups
Recovery
Plans
1-2
VDM
Load-Balancing
SRM
Testing
and Multi-Server
and Failover
Failover Testing
and Failover
Failback
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-3
Lesson Topics
What Is a Disaster?
What Is Disaster Recovery?
Risk Assessment
Recovery Sites and Data Replication
Remote Site Separation
Physical Disaster Recovery Process
Complications of Traditional Recovery
Recovery Point Objective (RPO)
Recovery Time Objective (RTO)
Business Continuity
Organizational Impact
Challenges of Disaster Recovery
Service Disruptions
Regulatory Compliance
Disaster Recovery Planning and BCP
Failover and Failback
Disaster Recovery Planning
Runbooks
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-4
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-5
What Is a Disaster?
Complete loss of a datacenter
Often caused by a natural disaster
Loss might include destruction of the facility, or it might just
render it unusable for a significant amount of time.
Declaration of a disaster usually requires consensus from
multiple parts of the organization (at the CEO/CFO level).
What is not a disaster?
Failure of an individual system
A temporary service interruption
Corporate
Data Center
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-6
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-7
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-8
Risk Assessment
Most businesses are at risk for at least one of the following:
Acts of nature
Fire
Wildfire
Earthquake or volcanic eruption
Tornado
Hurricane
Flooding and water damage
Man-made disasters
Acts of terrorism
Accidents and mistakes
1-9
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-10
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-11
Recovery
Production
WAN
Hardware configuration
System disk
Application installation
Application data
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
Transfer data to
the recovery site.
Backup tapes
CDs/DVDs
Images (e.g., Ghost)
Replication
1-12
OS recovery
Configuration
Data recovery
Testing
Recovery
Tier
RPO
RTO
Cost
Immediate
Immediate
$$$
24+ hrs
48+ hrs
$$
7+ days
5+ days
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-13
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-14
RTO includes:
Fault detection
Recovering data
Bringing applications back online
1-15
CEO / CFO
Business continuity
High availability
Department /
IT
Department/IT
IT
Reactive procedures
Disaster recovery
Backups
Disaster
Recovery
Business
Continuity
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-16
Reduce risk
Control cost
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-17
Service Disruptions
DR plans could be used during service disruptions.
Planned
Maintenance
Shared resource contention
Unplanned
Application-level failure
APPLICATION
Hardware-level failure
Datacenter-level or site-level failure
Natural disaster
HARDWARE
DATACENTER
GEOGRAPHIC
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-18
Regulatory Compliance
What compliance guidelines control your business?
Recovery time objectives (RTOs)
Recovery point objectives (RPOs)
Manual vs. automatic
Failback requirements
Security and access controls
Technologies to use
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-19
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-20
Failback
Businesses usually cannot run on the recovery site forever.
Failback must be done in an orderly manner to prevent further service
disruptions.
Failback might be even harder than failover.
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-21
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-22
1-23
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-24
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-25
Runbooks
A runbook is a specific set of step-by-step procedures to
guide an operator or a system administrator in how to do
the following:
Rebuild a server starting with the operating system
Restore key user account information (user IDs and passwords)
Recreate infrastructure components such as LDAP directory-based
organizational units (OUs), groups, folders, trees, security rights, and
privileges
Reload key application software
Reconfigure the application software
Reload the application data (often by recovering from backup media)
Reopen the system for end users to return to work
1-26
Create Runbooks
Create a runbook for each identified asset.
Order the runbooks by priority so that they will be executed
in the correct order.
Order and priority are based on RPOs and RTOs.
Make sure that you account for system, department, and
function interdependencies when you plan the runbook
order.
1 Infrastructure:
DNS
DHCP
AD
2 Production:
Manufacturing
control
Process control
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
3 Customer-facing:
Web site
Order center
Help desk
1-27
4 Financials:
Payroll
Accounting
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-28
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-29
Module Summary
Disaster recovery is the process of successfully
developing, testing, and implementing disaster recovery
plans.
A disaster recovery plan contains procedures to implement
during and immediately following a disaster.
Identify and explain the following terms and concepts:
Disaster recovery is not a product.
Risk assessment
Loss criteria
Remote sites
MTD
Runbooks
Testing
VMwareSiteRecoveryManagerRevA
Copyright2008VMware,Inc.Allrightsreserved.
1-30
Questions?
1-31