AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team ingests daily CSV files to an S3 landing bucket. A Step Functions workflow then triggers an AWS Glue ETL job that writes the data to Parquet. The team must first ensure that the order_id and total_amount columns contain no NULL values. If validation fails, the workflow should stop and alert the on-call engineer. With minimal code changes, which solution meets these requirements MOST effectively?

  • Add an Amazon Athena task to the workflow that runs a SELECT COUNT(*) query for NULL values in the two columns and branches execution based on the query result.

  • Create an AWS Glue DataBrew profile job with a ruleset that checks for missing values in the two columns, invoke it as the first Step Functions task, and use EventBridge to send an SNS alert if the job status is FAILED.

  • Enable Amazon Macie on the landing bucket and configure findings filters; stop the workflow and alert the team when Macie generates findings.

  • Load each file into an Amazon Redshift staging table with NOT NULL constraints on the two columns; proceed only if the COPY command succeeds, otherwise publish an SNS alert.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot