AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company lands daily CSV order files in Amazon S3. An AWS Glue Spark job loads the data into Amazon Redshift. Each record must contain a non-null customer_id and an order_total greater than 0. If more than 0.5 % of rows break either rule, the pipeline must halt and alert operations; otherwise loading continues. What is the most efficient way to add this validation with minimal new code?

Run an AWS Step Functions workflow that executes an Athena query after loading to count invalid rows and rolls back the transaction if the 0.5% limit is exceeded.
Add a DataBrew profile job to scan the CSV files before every Glue run and trigger the Glue job only if the profile shows fewer than 0.5% invalid rows.
Insert an AWS Glue Data Quality transform with a ruleset that stops the job when more than 0.5% of rows fail completeness and custom checks.
Create CHECK constraints on the Redshift target table so the COPY command rejects any rows with null customer_id or non-positive order_total.

AWS Certified Data Engineer Associate DEA-C01

Data Operations and Support

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is an AWS Glue Data Quality transform?

Why is the Evaluate Data Quality transform more efficient than a Step Functions or DataBrew solution?

What is Completeness in the context of data validation?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is an AWS Glue Data Quality transform?

Why is the Evaluate Data Quality transform more efficient than a Step Functions or DataBrew solution?

What is Completeness in the context of data validation?

Report Issue