AWS Certified Data Engineer Associate DEA-C01 Practice Question
An AWS Step Functions pipeline ingests hourly JSON files from Amazon S3 created by 1,000 devices. Each file must contain 60 records, and every record must include the keys device_id, ts, and temperature. The team needs a serverless check for these completeness rules before loading data into Amazon Redshift. If a rule fails, the pipeline must halt and a CloudWatch alarm must alert the on-call staff. Which approach is most cost-effective?
Develop an AWS Lambda function that reads the file from S3, counts rows, checks each JSON object, and emits a custom CloudWatch metric that triggers an alarm on anomalies.
Add a Step Functions task that runs an Amazon Athena query to COUNT(*) rows and test for NULL values; raise a CloudWatch alarm if the query result violates thresholds.
Create an AWS Glue DataBrew profile job with data-quality rules for the required columns and a RowCount = 60 rule. Start the job from Step Functions and use an EventBridge rule that catches a FAILED validationState to raise a CloudWatch alarm and fail the state machine.
Load the file into Amazon Redshift with the COPY command configured with MAXERROR 0 and ACCEPTANYDATE, then trigger a CloudWatch alarm if the COPY operation fails.
AWS Glue DataBrew lets you define no-code data-quality rules, including a RowCount = 60 rule and column-completeness checks. A DataBrew profile job with the ruleset can be started from Step Functions by using the StartJobRun.sync service integration. When the job finishes, DataBrew emits a Ruleset Validation Result event; an EventBridge rule can detect validationState = FAILED, trigger a CloudWatch alarm (for example via SNS), and set the state machine to fail, stopping further processing. Athena SQL or custom Lambda code require more development effort, and COPY into Redshift validates only after loading begins, so they are less suitable and potentially more expensive.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue DataBrew and how does it help with data-quality checks?
Open an interactive chat with Bash
What is the role of AWS Step Functions in this pipeline solution?
Open an interactive chat with Bash
How does AWS EventBridge interact with failures in DataBrew jobs?
Open an interactive chat with Bash
What are AWS Glue DataBrew profile jobs?
Open an interactive chat with Bash
How does EventBridge integrate with AWS Step Functions?
Open an interactive chat with Bash
Why is AWS Glue DataBrew more cost-effective than Lambda or Redshift for this use case?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .