AWS Certified Data Engineer Associate DEA-C01 Practice Question
An e-commerce company lands daily CSV order files in Amazon S3. An AWS Glue Spark job loads the data into Amazon Redshift. Each record must contain a non-null customer_id and an order_total greater than 0. If more than 0.5 % of rows break either rule, the pipeline must halt and alert operations; otherwise loading continues. What is the most efficient way to add this validation with minimal new code?
Create CHECK constraints on the Redshift target table so the COPY command rejects any rows with null customer_id or non-positive order_total.
Add a DataBrew profile job to scan the CSV files before every Glue run and trigger the Glue job only if the profile shows fewer than 0.5% invalid rows.
Run an AWS Step Functions workflow that executes an Athena query after loading to count invalid rows and rolls back the transaction if the 0.5% limit is exceeded.
Insert an AWS Glue Data Quality transform with a ruleset that stops the job when more than 0.5% of rows fail completeness and custom checks.
Add an Evaluate Data Quality transform to the existing Glue job. Define a ruleset such as Completeness "customer_id" > 0.995 and ColumnValues "order_total" > 0 with threshold > 0.995. Enable the Fail-job-without-loading option so Glue stops when the ruleset fails. Metrics published to CloudWatch/EventBridge trigger standard alerts. Other approaches validate after loading, are unenforced in Redshift, or require separate jobs/workflows.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is an AWS Glue Data Quality transform?
Open an interactive chat with Bash
Why is the Evaluate Data Quality transform more efficient than a Step Functions or DataBrew solution?
Open an interactive chat with Bash
What is Completeness in the context of data validation?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .