AWS Certified Data Engineer Associate DEA-C01 Practice Question

An IoT application ingests 50,000 1 KB JSON events per second from 100,000 sensors. Each event has a device_id, timestamp, and payload. Engineers must query the past 24 hours of data for a given device_id in single-digit milliseconds. Records older than 24 hours are exported nightly to Amazon S3 for analytics. Which storage solution best supports this hot dataset with minimal operational effort?

  • Use an Amazon RDS for MySQL database with Auto Scaling read replicas to handle the throughput.

  • Load the incoming data into an Amazon Redshift cluster and run SQL queries for each device lookup.

  • Write the events to Amazon S3 in hourly Parquet files and query them with Amazon Athena when a device lookup is needed.

  • Store the hot dataset in an Amazon DynamoDB table with device_id as the partition key and a sort key on timestamp.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot