High Availability and Fault Tolerance Flashcards
AWS Certified CloudOps Engineer Associate SOA-C03 Flashcards

| Front | Back |
| Difference between RTO and RPO | Recovery Time Objective focuses on downtime while Recovery Point Objective focuses on data loss tolerance. |
| How do EC2 status checks contribute to fault tolerance | By monitoring instance health and automatically recovering unhealthy instances. |
| How do Route 53 health checks support High Availability | By monitoring resources and routing traffic away from failed endpoints. |
| How does Amazon DynamoDB achieve high availability | By replicating data across multiple zones and offering eventually consistent reads. |
| How does Amazon SQS achieve fault tolerance | Queue messages are replicated and durable across multiple Availability Zones. |
| How does Elastic Load Balancing support High Availability | By distributing incoming traffic across multiple targets in multiple Availability Zones. |
| How does S3 cross-region replication help in disaster recovery | By automatically copying objects across regions for redundancy. |
| Key benefit of Elastic File System (EFS) for fault tolerance | EFS provides scalable, multi-AZ file storage that persists even during failures. |
| Key difference between Multi-AZ and Read Replica in RDS | Multi-AZ enhances availability; Read Replica is primarily for scaling workloads. |
| Use of AWS Lambda for Fault Tolerance | Lambda's automatic scaling and multi-AZ availability ensure high fault tolerance for applications. |
| What does AWS Auto Scaling do | Automatically adjusts capacity to maintain consistent performance and availability. |
| What does AWS CloudFormation ensure in high availability setups | Automated resource creation and updates across multiple regions/zones. |
| What is a quorum-based failover in databases | A failover model requiring a majority vote from nodes to confirm decision-making. |
| What is AWS Global Accelerator | A service that improves application availability through intelligent routing and global endpoints. |
| What is Fault Tolerance | The ability of a system to continue functioning even when one or more components fail. |
| What is High Availability | The ability of a system to remain operational and accessible for maximum time with minimal downtime. |
| What is Multi-AZ in AWS RDS | A deployment configuration that provides automatic failover to a secondary database in a different availability zone. |
| What is Pilot Light Disaster Recovery | A minimal backup environment with essential services ready to scale when needed. |
| What is the principle of a hot standby for disaster recovery | Maintaining a fully operational secondary environment that can take over instantly. |
| Why use AWS Backup | To centralize and automate backup for resources across various AWS services ensuring data resilience. |
About the Flashcards
Flashcards for the AWS Certified CloudOps Engineer Associate exam offer a focused refresher on cloud resilience-covering high availability principles, fault-tolerant design, and the metrics that drive recovery planning. Review how services such as Elastic Load Balancing, Auto Scaling, RDS Multi-AZ, DynamoDB, Route 53, and Global Accelerator work together to minimize downtime and data loss.
The deck drills key disaster-recovery patterns like pilot light, hot standby, and cross-region replication while reinforcing critical comparisons-RTO vs RPO, Multi-AZ vs Read Replica, health checks, and quorum-based failover. Concise cards help you memorize definitions, identify exam-style scenarios, and apply AWS automation tools that ensure resilient, scalable applications.
Topics covered in this flashcard deck:
- High availability concepts
- Fault tolerance strategies
- Disaster recovery patterns
- AWS scalability services
- RPO and RTO metrics
- Multi-AZ & cross-region designs