Reliability - Change Enablement in the Cloud

Reliability

Change implementation can have a direct impact on the availability of workloads and the ability to recover from major incidents or disasters. Change automation is foremost in maximizing application availability. If you have any manual processes, you lose critical time awaiting those manual actions. Theoretically, the smaller in size a change is, the lower the potential impact of that change on the business.

Use deployment patterns that reduce risk, such as blue-green or canary deployments. Perform comprehensive testing in pipelines, including load, performance under load, and resiliency testing. Effective monitoring of the key performance indicators (KPIs) is a requirement, and automated rollback should be initiated if those KPIs indicate thresholds are likely to be exceeded.

Testing disaster recovery thoroughly helps you meet recovery objectives. Use automation to backup data. Regularly restore and recover to validate your recovery process and procedures.

These considerations improve the reliability of workloads and decrease business risk. Cloud change enablement practices should reflect this reduction in risk, and organizations should consider that because the risk is minimized, and reversible, they can be processed as standard changes.