Archetypes for Reliable Systems
May 01, 2023
31 min
Free
reliability
kubernetes
cloud-architecture
distributed-systems
service-level-objectives
site-reliability-engineering
platform-engineering
multi-cluster
failure-domains
availability
ci-cd
slo
Description
This presentation introduces a model and implementation for designing and running cloud-based internet services at various levels of reliability. It leverages "Deployment Archetypes for Cloud Applications" to help customers define reliability needs (availability, failure domain resilience, RTO/RPO) and then provides a Kubernetes-based deployment strategy. The approach supports multi-tenant, multi-application, and multi-cluster deployments with integrated CI/CD, micro-segmentation, policy management, traffic routing, SLOs, and monitoring, enabling application teams to own their services while infrastructure teams manage updates without interruption.