AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineer built an AWS Step Functions workflow that submits a Spark step to an Amazon EMR cluster to process yesterday's click-stream logs. About once a week the step fails because of transient network errors. The pipeline must retry automatically and page the on-call team only if the job still fails after several attempts. Which Step Functions design meets these needs with minimal operational effort?

  • Place the EMR step in a Parallel state with two identical branches so that at least one branch succeeds if the other encounters a transient failure.

  • Configure the EMR step task with a Retry policy that uses exponential back-off and a maximum of three attempts, and add a Catch path that publishes an SNS notification if all retries fail.

  • Use Amazon EventBridge to schedule a second execution of the entire state machine 30 minutes later; if the first run fails, the second run will process the same data.

  • Increase the number of core nodes on the EMR cluster and disable task timeouts to reduce the chance of transient failures interrupting the Spark step.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot