Definitions - Reliability Pillar

Definitions

This whitepaper covers reliability in the cloud, describing best practice for these four areas:

  • Foundations

  • Workload Architecture

  • Change Management

  • Failure Management

To achieve reliability you must start with the foundations—an environment where service quotas and network topology accommodate the workload. The workload architecture of the distributed system must be designed to prevent and mitigate failures. The workload must handle changes in demand or requirements, and it must be designed to detect failure and automatically heal itself.