AWS Certified Data Engineer Associate DEA-C01 Practice Question

Your company ingests daily CSV files into an S3 data lake, runs an AWS Glue Spark job to denormalize the data, and then loads the result into an Amazon Redshift table that has a primary key on order_id. Duplicate order_id values occasionally appear in the source data and cause the Redshift load to fail. You must add an automated step to the existing Glue workflow that verifies the order_id column contains only unique values and stops the workflow if the rule is violated. Which approach satisfies these requirements with minimal custom code?

Create an AWS Glue Data Quality ruleset that uses an IsUnique rule on the order_id column and configure the evaluation action to fail the workflow when the rule fails.
Enable versioning on the S3 bucket so duplicate files are stored as separate object versions, ensuring the downstream load receives only the latest data.
Use AWS Database Migration Service with a full-load task followed by validation to compare S3 data with Redshift and detect any duplicate records before loading.
Turn on AWS Glue job bookmarks so previously processed rows are skipped and duplicates are automatically removed during the next job run.

AWS Certified Data Engineer Associate DEA-C01

Data Operations and Support

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is AWS Glue Data Quality?

How does the IsUnique rule in AWS Glue Data Quality work?

Why can't S3 versioning or Glue job bookmarks solve the duplicate order_id issue?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is AWS Glue Data Quality?

How does the IsUnique rule in AWS Glue Data Quality work?

Why can't S3 versioning or Glue job bookmarks solve the duplicate order_id issue?

Report Issue