AWS Certified Data Engineer Associate DEA-C01 Practice Question

Hourly sales data lands as multiple CSV files under s3://sales/raw/YYYY/MM/DD/HH/. A _SUCCESS flag finishes the batch. The team must launch an AWS Glue job only after the flag arrives, retry transient errors with exponential backoff, and send one SNS alert if all retries fail, using as little custom code as possible. Which architecture satisfies these needs?

  • Deploy Amazon Managed Workflows for Apache Airflow and create a DAG that polls the S3 prefix every minute, runs the Glue job when files are complete, and includes Python-based retries and SNS alerts.

  • Create an Amazon EventBridge rule that matches the _SUCCESS object, then invoke an AWS Step Functions state machine that uses a Glue StartJobRun task with Retry and a Catch path that publishes to SNS.

  • Configure an S3 PUT event to invoke a Lambda function for every file. The function tracks files in DynamoDB and starts the Glue job when all files are present, implementing retries and SNS publishing in code.

  • Schedule the Glue job with an hourly Glue trigger, enable job-level retries, and use a CloudWatch alarm on job failures to notify SNS.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot