AWS Certified Data Engineer Associate DEA-C01 Practice Question
A workload must ingest 20 MB/s of 20 KB JSON messages produced by thousands of IoT devices and make each record available to a downstream analytics application within a few hundred milliseconds. Which solution meets the throughput and latency requirements in the most cost-effective way?
Publish the events to an Amazon EventBridge bus and have a rule invoke the analytics application for each event.
Send the messages to an Amazon Kinesis Data Streams stream sized with at least 20 shards, then have the analytics application consume from the stream.
Buffer records on each device and write multipart objects directly to an Amazon S3 bucket, then trigger processing with S3 event notifications.
Send the data to an Amazon Kinesis Data Firehose delivery stream with default buffering and deliver it to the analytics application.
Amazon Kinesis Data Streams is designed for high-throughput, low-latency streaming ingestion. Each shard supports up to 1 MB/s or 1,000 records per second, and records are typically readable by consumers in less than one second. Provisioning 20 shards therefore supplies 20 MB/s of write capacity while meeting the sub-second availability goal.
Amazon Data Firehose (formerly Kinesis Data Firehose) adds a delivery buffer-300 s by default for S3 and 0-60 s for most other destinations. Even with the new zero-buffering mode, AWS states that most deliveries occur within about five seconds, so it cannot guarantee availability within a few hundred milliseconds. Writing individual objects directly to Amazon S3 and reacting with S3 event notifications typically introduces seconds or longer of lag because notifications are usually delivered in seconds and can occasionally take a minute or more. Amazon EventBridge charges $1 per million events, has default PutEvents rate limits of 600-10,000 TPS (Region-dependent), and therefore costs significantly more than a provisioned Kinesis stream sized for this workload. For these reasons, a properly sized Kinesis Data Streams solution is the most cost-effective way to meet both throughput and latency requirements.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What are the key differences between Kinesis Data Streams and Kinesis Data Firehose?
Open an interactive chat with Bash
How do you determine the number of shards needed for a Kinesis Data Streams workload?
Open an interactive chat with Bash
Why is Amazon S3 not suitable for workloads requiring millisecond-level latency?
Open an interactive chat with Bash
What is Amazon Kinesis Data Streams, and why is it suited for this use case?
Open an interactive chat with Bash
Why is Amazon Kinesis Data Firehose not appropriate for this workload?
Open an interactive chat with Bash
What are the limitations of using Amazon EventBridge for high-velocity data streaming?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .