AWS Certified Data Engineer Associate DEA-C01 Practice Question
An analytics team receives hourly CSV files from external vendors. When a file lands in an S3 bucket, it must be validated, transformed with AWS Glue, and loaded into Amazon Redshift. The solution must be serverless, event-driven, include retry logic, and minimize operational overhead. Which architecture best meets these requirements?
Create a CloudWatch Events scheduled rule that runs every 5 minutes and invokes a Lambda function. The function lists recently added objects, kicks off an AWS Batch job to transform the data, and then loads the results into Redshift.
Deploy Apache Airflow on an EC2 Auto Scaling group and build a DAG that polls the S3 bucket every minute, then starts a Glue job and a Redshift COPY task.
Configure an S3 Event Notification to deliver ObjectCreated events to EventBridge, which triggers a Step Functions state machine. The state machine runs a Glue job for transformation, then uses the Redshift Data API to issue a COPY command. Step Functions built-in retries handle transient failures.
Set up Kinesis Data Firehose with the S3 bucket as the data source, enable transformation with a Lambda function, and configure the delivery stream to load directly into Amazon Redshift.
Sending S3 ObjectCreated events to EventBridge and using the event to start a Step Functions state machine provides a fully serverless, event-driven workflow. Step Functions can invoke a Glue job, wait for completion, and then use the Redshift Data API to run a COPY command. Built-in retry and error-handling policies satisfy the resiliency requirement with no servers to manage.
Airflow on EC2 introduces cluster management and relies on polling, not events. A CloudWatch scheduled rule is time-based rather than event-driven and requires custom logic for retries. Kinesis Data Firehose cannot accept S3 objects as a source, so it cannot react to new S3 files. Therefore, only the Step Functions approach aligns with the constraints.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is EventBridge and how does it help in serverless workflows?
Open an interactive chat with Bash
How does Step Functions handle retries and error handling in workflows?
Open an interactive chat with Bash
What is the role of the Redshift Data API in this architecture?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .