AWS Certified Data Engineer Associate DEA-C01 Practice Question
An e-commerce company collects user activity as JSON events in Amazon Kinesis Data Streams. Each record must be enriched with reference data and written to Amazon S3 exactly once, even after job restarts. Which solution implements stateful transactions while requiring the least custom code?
Configure Kinesis Data Firehose with a Lambda transformation to enrich the data and deliver it to S3.
Send the events to an Amazon MSK topic and use a Lambda function running as a container image to poll, enrich, and write the data to S3.
Invoke an AWS Lambda function from the Kinesis stream with a batch of records; have the function enrich each record and upload it to S3.
Create an AWS Glue streaming ETL job that consumes the Kinesis stream, enables Spark checkpointing in Amazon S3, enriches the events, and writes the results to S3.
AWS Glue streaming ETL jobs run on Apache Spark Structured Streaming, which checkpoints offsets and operator state in Amazon S3. If the job restarts, it resumes from the last committed checkpoint, ensuring each record is processed exactly once. Lambda functions that are triggered by Kinesis, Lambda transformations in Kinesis Data Firehose, and Lambda consumers of Amazon MSK are stateless; they receive batches at least once and must implement their own idempotency logic to avoid duplicates. Therefore, the Glue streaming job is the only option that provides built-in state management with minimal additional code.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Spark checkpointing in AWS Glue?
Open an interactive chat with Bash
Why are AWS Lambda functions considered stateless for streaming data processing?
Open an interactive chat with Bash
How does AWS Glue manage exactly-once processing?
Open an interactive chat with Bash
What is Apache Spark Structured Streaming and how does it work?
Open an interactive chat with Bash
What is Spark checkpointing and why is it important?
Open an interactive chat with Bash
Why doesn't AWS Lambda provide built-in state management for stream processing?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .