AWS Certified Data Engineer Associate DEA-C01 Practice Question
A retail company runs an AWS Glue Spark job that reads daily order files from an S3 bucket, transforms the data, and loads it into an Amazon Redshift fact table. The business requires that the order_id column be unique each day; if any duplicate order_id values are found, the load must abort and an alert must be sent. Which approach most efficiently enforces this rule inside the existing Glue job while minimizing additional services?
Attach an AWS Lake Formation row-level access filter to drop duplicate order_id rows during the load.
Add an AWS Glue Data Quality transform to the job, create a ruleset that asserts order_id is unique, and configure the job to fail if the rule is violated.
Define a UNIQUE constraint on order_id in the Redshift table and let the COPY command reject duplicate rows after the data load.
Enable Amazon S3 Object Lock in compliance mode on the raw bucket so duplicate objects cannot be written.
AWS Glue Data Quality lets you insert a Data Quality transform into a Glue Spark job. In that transform you attach a ruleset-for example, a Uniqueness rule on the order_id column. At run time the job evaluates the dataset before the load step; if any duplicate values are detected, you can configure the job to fail immediately and publish metrics to CloudWatch, where an alarm can trigger an alert. This keeps inconsistent data from reaching Amazon Redshift without adding new infrastructure.
Amazon S3 Object Lock protects objects from being deleted or overwritten but does not inspect their contents, so it cannot detect duplicate rows. Redshift UNIQUE constraints are informational only, so the COPY command would still load duplicates and silently violate the requirement. Lake Formation row-level filters control access, not data quality, and cannot abort a job on duplicate values.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue Data Quality?
Open an interactive chat with Bash
How does Amazon CloudWatch assist with AWS Glue Data Quality alerts?
Open an interactive chat with Bash
Why is enforcing uniqueness with AWS Glue Data Quality better than using Redshift UNIQUE constraints?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .