AWS Certified Data Engineer Associate DEA-C01 Practice Question

Your company stores raw click-stream events as gzip-compressed JSON files in an S3 bucket partitioned by dt=YYYY-MM-DD. Analysts report that some records occasionally lack the required session_id field. You must generate a curated dataset in another S3 prefix that contains only valid records, can be refreshed daily, and uses standard SQL while remaining fully serverless and cost-efficient. Which solution meets these requirements?

Provision an Amazon EMR cluster with Hive, schedule a daily HiveQL job that selects only records with a non-null session_id and writes the output to another S3 prefix, then terminate the cluster.
Load the raw files into Amazon Redshift Serverless each day, issue a SQL query to remove null session_id values, and UNLOAD the cleaned data back to a different S3 location.
Run a CREATE TABLE AS SELECT query in Amazon Athena that filters out rows where session_id IS NULL and writes the results to a new S3 prefix; use Athena Scheduled Queries to execute the statement daily.
Create an AWS Glue DataBrew project pointing at the S3 dataset, add a recipe step to delete rows with null session_id, and run the DataBrew job on a daily schedule.

AWS Certified Data Engineer Associate DEA-C01

Data Operations and Support

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is Amazon Athena and how does it work?

What is a CREATE TABLE AS SELECT (CTAS) query in Athena?

What are Athena Scheduled Queries and how do they work?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is Amazon Athena and how does it work?

What is a CREATE TABLE AS SELECT (CTAS) query in Athena?

What are Athena Scheduled Queries and how do they work?

Report Issue