AWS Storage and Data Management Flashcards
AWS Certified Data Engineer Associate DEA-C01 Flashcards

| Front | Back |
| How can you optimize Redshift performance | Use distribution keys, sort keys, and compression. |
| How does Amazon Glacier differ from S3 | Designed for long-term, low-cost archival storage with retrieval delays. |
| How does Amazon Redshift handle complex queries | Parallel processing and columnar storage for high-speed analysis. |
| How does Amazon S3 Transfer Acceleration work | Speeds up cross-region uploads using AWS edge locations. |
| How does DynamoDB achieve scalability | Uses partitioning and auto-scaling to handle large workloads seamlessly. |
| How does DynamoDB Streams help in real-time data processing | Captures table-level changes for downstream applications. |
| How does S3 Versioning help | Tracks changes to objects and maintains previous versions for recovery. |
| What are Reserved Instances in RDS | Discounted pricing for long-term RDS usage commitment. |
| What are S3 storage classes | Tiers designed for different access patterns, including Standard, Intelligent-Tiering, and Glacier. |
| What are the different backup options in RDS | Supports automated snapshots and manual backups. |
| What does the SELECT statement in S3 Query do | Allows querying of S3 objects using SQL-like syntax. |
| What is Amazon Aurora | High-performance relational database compatible with MySQL and PostgreSQL. |
| What is Amazon DynamoDB | NoSQL database for low-latency applications with support for key-value and document data models. |
| What is Amazon Redshift | Cloud-based data warehouse optimized for large-scale analytical queries. |
| What is Amazon S3 | Scalable object storage for storing and retrieving any amount of data. |
| What is AWS DataSync | Service for seamless movement of data between on-premises storage and AWS. |
| What is Elastic Block Store (EBS) | Block storage for use with EC2 instances, optimized for performance and reliability. |
| What is Lifecycle Management in Amazon S3 | Automates transition and expiration of objects based on rules. |
| What is the difference between RDS Multi-AZ and Read Replicas | Multi-AZ provides failover, while Read Replicas improve read scalability. |
| What is the purpose of S3 Multipart Upload | Improves upload efficiency for large objects by breaking them into parts. |
| What is the Query function in DynamoDB | Retrieves items based on primary key attributes. |
| What purpose does S3 Object Lock serve | Enforces data immutability for compliance or business needs. |
| What type of storage does Amazon RDS use | Managed relational database services using SSD-backed EBS volumes. |
| When should you use an S3 bucket policy | To define access permissions for the entire bucket. |
About the Flashcards
Flashcards for the AWS Certified Data Engineer Associate exam help you review the AWS storage and database services most likely to appear on test day. Each card explains core features of Amazon S3, including storage classes, versioning, and object lock, so you can recall how different tiers, lifecycle rules, and security controls align with varying workloads.
The deck also covers DynamoDB partitioning, streams, and query operations; RDS, Aurora, and EBS fundamentals like Multi-AZ, read replicas, backups, and reserved instances; plus Redshift performance tuning and DataSync transfer acceleration. Reviewing these terms reinforces architectural decisions around scalability, cost optimization, fault tolerance, and analytics demanded by the certification.
Topics covered in this flashcard deck:
- Amazon S3 storage classes
- S3 policies & lifecycle
- DynamoDB scaling features
- RDS and Aurora management
- Redshift performance tuning
- DataSync and transfer acceleration