AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company stores daily transaction CSV files in Amazon S3. The downstream ML pipeline fails whenever numeric columns contain null or non-numeric values. You need an automated, low-code solution that validates and cleans each new file, stores the corrected data in a curated S3 prefix, and provides a summary of invalid records. Which approach requires the least operational effort?

Spin up an Amazon EMR cluster running Apache Spark, develop a PySpark script to validate and cleanse the dataset, and schedule the job with AWS Step Functions.
Create an AWS Glue DataBrew project that applies data quality rules, schedule a recipe job to output cleaned data to a curated S3 prefix, and rely on the job run metrics for the invalid-row summary.
Build a custom Docker image that uses pandas to clean the files and run it daily with AWS Batch, writing logs of invalid rows to Amazon CloudWatch Logs.
Invoke an Amazon Athena CTAS query from an AWS Lambda function each day to select only valid rows into a new table stored in a different S3 prefix and publish results to Amazon SNS.

AWS Certified Data Engineer Associate DEA-C01

Data Operations and Support

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What are data quality rules in AWS Glue DataBrew?

How do DataBrew recipes work in cleaning datasets?

What are AWS Glue DataBrew job run metrics, and how are they used?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What are data quality rules in AWS Glue DataBrew?

How do DataBrew recipes work in cleaning datasets?

What are AWS Glue DataBrew job run metrics, and how are they used?

Report Issue