AWS Certified Data Engineer Associate DEA-C01 Practice Question
An online retailer ingests 5,000 click-stream events per second into an Amazon Kinesis Data Stream. An AWS Lambda function enriches each event and loads it into Amazon Redshift. A parser bug corrupted the last 7 days of data. After fixing the code, the team must reload the last 7 days while new events keep arriving and maintain a replayable source of truth at low cost. Which solution meets these requirements?
Enable DynamoDB Streams on the Amazon Redshift table and use AWS Database Migration Service (AWS DMS) to reload the table with corrected data from the stream.
Add a Kinesis Data Firehose delivery stream that receives the same input and stores all raw events in an Amazon S3 bucket with a lifecycle policy, then run an AWS Glue job to reload the past seven days into Amazon Redshift after the fix is deployed.
Increase the Kinesis data stream retention period to seven days, create a consumer that starts with an AT_TIMESTAMP iterator at seven days ago, and invoke the updated Lambda until the iterator reaches the present time.
Turn on audit logging for Amazon Redshift, export the logs for the last seven days to Amazon S3, correct the records, and reload them into Amazon Redshift with the COPY command.
Writing the raw events to Amazon S3 through a Kinesis Data Firehose delivery stream creates a durable, low-cost archive that is independent of the processing consumer. The archive can be kept permanently or rolled off with an S3 lifecycle rule. After the bug is fixed, the team can run a one-time AWS Glue (or other) job that reads the historical objects in S3 and reloads them into Amazon Redshift without disrupting the live Lambda consumer. Extending Kinesis retention and manually replaying data adds shard-hour costs and custom consumer logic. DynamoDB Streams cannot be enabled on a Redshift table, and Redshift audit logs lack the raw click-stream records needed for reprocessing, so those options will not restore the missing data.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Amazon Kinesis Data Firehose?
Open an interactive chat with Bash
How does AWS Glue integrate with Amazon Redshift?
Open an interactive chat with Bash
What is an S3 lifecycle policy and how does it work?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .