AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company ingests sales records into Amazon S3 and processes them nightly with an AWS Glue Spark job. The business requires the pipeline to stop automatically if more than 2% of rows have a null order_id so that inaccurate reports are not produced. What is the simplest way to enforce this data quality rule?
Modify the ETL script to write counts of order_id nulls to Amazon DynamoDB and use a CloudWatch alarm to stop the job when the threshold is breached.
Create an AWS Glue Data Quality ruleset that checks the completeness of the order_id column and configure the Glue job to fail when more than 2% of rows violate the rule.
Load the data into an Amazon Redshift staging table that enforces a NOT NULL constraint on order_id and abort the COPY command if invalid rows are detected.
Trigger an AWS Lambda function with Amazon EventBridge after the job finishes to run an Athena query that counts null order_id values and manually rerun the job if the threshold is exceeded.
AWS Glue Data Quality lets you create rulesets such as completeness checks for specific columns. You can associate the ruleset with a Glue table and configure the ETL job to evaluate the rules before or during execution. When the null-value threshold is exceeded, the job automatically fails, preventing bad data from propagating. The other options either require building and maintaining custom code, external monitoring, or moving the data to another service, all of which add unnecessary operational overhead compared with the managed Glue Data Quality feature.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue Data Quality?
Open an interactive chat with Bash
What is a completeness rule in AWS Glue Data Quality?
Open an interactive chat with Bash
Why is AWS Glue's Data Quality feature better than the custom solutions in the other options?
Open an interactive chat with Bash
What is AWS Glue Data Quality?
Open an interactive chat with Bash
How do you configure a Glue job to enforce a Data Quality rule?
Open an interactive chat with Bash
Why are the other solutions less efficient compared to AWS Glue Data Quality?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .