-
Redundancy Planning
-
Redundant Power
-
Undesirable Power Conditions
-
Surge:
- An unexpected increase in the amount of voltage provided.
-
Spike:
- A short transient in voltage that can be due to a short circuit, tripped circuit breaker, power outage, or lightning strike.
-
Sag:
- An unexpected decrease in the amount of voltage provided.
-
Brownout:
- When the voltage drops to such an extent that it typically causes the lights to dim and causes computers to shut off.
-
Blackout:
- When a total loss of power for a prolonged period occurs.
-
Redundant Power Supplies
- An enclosure that contains two complete power supplies, the second of which turns on when the first fails.
-
Uninterruptible Power Supplies
- Takes the functionality of a surge suppressor and combines that with a battery backup, protecting computers not only from surges and spikes, but also from sags, brownouts, and blackouts.
-
Generators
-
Backup Generator:
- Part of an emergency power system used when there is an outage of regular electric grid power.
-
Standby Generator:
- Systems that turn on automatically within seconds of a power outage.
-
Redundant Data
-
Disk Duplexing:
- When each disk is connected to a separate controller.
- RAID
-
Redundant Networking
-
Redundant ISP:
- Secondary connections to another ISP; for example, a backup T-1 line.
-
Redundant Servers
-
Cluster:
- Two or more servers that work with each other.
-
Redundant Sites
-
Hot Site
- Offsite backups that may replace the current active site in case of an adverse security incident
- Most expensive
-
Cold Site
- Offsite backups in storage that cannot immediately be brought online in case of a security incident
- Least expensive
-
Warm Site
- An offsite location containing the offline hardware that would be required to replace the current active site. Data needed to replace the current active site is not immediately available, but may be accessed.
- Middle ground in terms of cost
-
Redundant People
- Separation of Duties
- Job Rotation
-
Failure and Recovery Concepts
-
Single Point of Failure
- An element, object, or part of a system that, if it fails, will cause the whole system to fail.
-
Mean Time Between Failures (MTBF)
- The average number of failures per million hours of operation for a product, based on historical baselines. It can be very helpful when making quantitative assessments.
-
Mean Time to Repair (MTTR)
- The time needed to repair a failed device
-
Mean Time to Failure (MTTF)
- A basic measure of reliability for devices that cannot be repaired.
-
Disaster Recovery
- Mission Essential Functions
- Identification of Critical Systems
-
Disaster Recovery Planning
-
Disaster Recovery Plans (DRPs):
- A plan that details the policies and procedures concerning the recovery and/or continuation of an organization's technology infrastructure.
-
Business Impact Analysis:
- The examination of critical versus noncritical functions; part of a business continuity plan (BCP).
-
Recovery Time Objective (RTO):
- In business impact analysis, the acceptable amount of time to restore a function.
-
Recovery Point Objective (RPO):
- In business impact analysis, the acceptable latency of data.