AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company ingests a daily CSV of ecommerce transactions into Amazon S3. The data engineering team must fail the downstream ETL pipeline if more than 1% of rows have a null customer_id or if any order_date is before 2020-01-01. They want to define these checks once and reuse them on future datasets without coding. Which AWS feature should they use?

  • Create an AWS Glue DataBrew data quality ruleset and attach it to a DataBrew profile job.

  • Invoke an AWS Lambda function from Amazon EventBridge to scan the file and throw an error when invalid rows are found.

  • Configure an AWS Glue crawler with custom classifiers to reject files that do not meet the conditions.

  • Run a scheduled Amazon Athena query that selects rows violating the conditions and check if the result set is empty.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot