Cover image for Blueprints for high availability : designing resilient distributed systems
Blueprints for high availability : designing resilient distributed systems
Marcus, Evan, 1962-
Personal Author:
Publication Information:
New York : John Wiley & Sons, [2000]

Physical Description:
xxii, 344 pages : illustrations ; 25 cm
General Note:
Includes index.
Added Author:
Format :


Call Number
Material Type
Home Location
Item Holds
QA76.9.D5 M357 2000 Adult Non-Fiction Central Closed Stacks

On Order



"Rely on this book for information on the technologies and methods you'll need to design and implement high-availability systems...It will help you transform the vision of always-on networks into a reality."-Dr. Eric Schmidt, Chairman and CEO, Novell Corporation
Your system will crash! The reason could be something as complex as network congestion or something as mundane as an operating system fault. The good news is that there are steps you can take to maximize your system availability and prevent serious downtime. This authoritative book will provide you with the tools to deploy a system with confidence. The authors guide you through the building of a network that runs with high availability, resiliency, and predictability. They clearly show you how to assess the elements of a system that can fail, select the appropriate level of reliability, and provide steps for designing, implementing, and testing your solution to reduce downtime to a minimum. All the while, they help you determine how much you can afford to spend by balancing costs and benefits. This book of practical, hands-on blueprints:
* Examines what can go wrong with the various components of your system
* Provides twenty key system design principles for attaining resilience and high availability
* Discusses how to arrange disks and disk arrays for protection against hardware failures
* Looks at failovers, the software that manages them, and sorts through the myriad of different failover configurations
* Provides techniques for improving network reliability and redundancy
* Reviews techniques for replicating data and applications to other systems across a network
* Offers guidance on application recovery
* Examines Disaster Recovery

Author Notes

EVAN MARCUS is a Senior Systems Engineer at VERITAS Software Corporation and co-designed a key piece of the first commercial Sun-based software for High Availability. He has been the company's consultant for successful implementations of VERITAS High Availability Products around the world.

HAL STERN is a Distinguished Systems Engineer at Sun Microsystems. He has led reliability and improvement teams for several financial services clients and focuses on performance, reliability, and networked system architecture. He is also the author of Managing NFS and NIS.

Table of Contents

What Is Resiliency?
Twenty Key System Design Principles
Highly Available Data Management
Redundant Server Design
Failover Management
Failover Configurations and Issues
Redundant Network Services
Data Service Reliability
Replication Techniques
Application Recovery
Backups and Restores
System Operations
Disaster Recovery
Parting Shot