AWS Certified Data Engineer Associate DEA-C01 Practice Question
An e-commerce application writes clickstream events to an Amazon Kinesis Data Streams stream with 5 shards and an average ingest rate of 4 MB/s. A data engineering team must build a near-real-time ETL pipeline that enriches each record and delivers results to Amazon S3. Processing latency must stay below 1 second, and other existing consumers must remain unaffected. Which solution meets these requirements?
Create an Amazon Kinesis Data Firehose delivery stream with the existing Kinesis Data Streams stream as the source, enable data transformation with AWS Lambda, and deliver the data to Amazon S3.
Configure an AWS Glue streaming job that reads from the Kinesis Data Streams stream using Enhanced Fan-Out and writes enriched records to Amazon S3 in Apache Parquet format.
Enable Amazon DynamoDB Streams on a table, process the stream with an AWS Glue job, and store the enriched data in Amazon S3.
Develop a Java application on Amazon EC2 that continuously polls the stream with the GetRecords API, enriches each record, and uploads the output files to Amazon S3.
An AWS Glue streaming job can act as a registered Enhanced Fan-Out (EFO) consumer on the Kinesis Data Streams stream. EFO allocates each consumer a dedicated outbound throughput of up to 2 MB/s per shard with sub-second latency, so the Glue job will not share the 2 MB/s per-shard limit imposed on standard GetRecords calls and therefore avoids throttling the other consumers. Glue's streaming job natively supports EFO, allows in-stream enrichment, and writes directly to Amazon S3, satisfying latency and isolation requirements.
Kinesis Data Firehose does not guarantee sub-second latency and offers only limited, Lambda-based transformation, making it unsuitable for complex enrichment needs. A custom polling application that uses GetRecords would contend with existing consumers for the same 2 MB/s per-shard limit, risking throttling and increased latency. DynamoDB Streams is unrelated to the Kinesis stream and would not ingest the clickstream data at all.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Enhanced Fan-Out (EFO) in Kinesis Data Streams?
Open an interactive chat with Bash
How does AWS Glue streaming jobs interact with Kinesis Data Streams?
Open an interactive chat with Bash
Why is Kinesis Data Firehose unsuitable for sub-second latency requirements?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .