AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company stores daily Parquet files of order events in Amazon S3, partitioned by dt=YYYY-MM-DD. A data engineer must deliver a daily report that shows the 7-day moving average of sales per product and allow analysts to perform ad-hoc joins with other S3 datasets from Amazon QuickSight. The solution must minimize infrastructure management and keep costs low. Which architecture meets these requirements?

  • Create an AWS Glue ETL job that loads the data into a provisioned Amazon Redshift cluster each night. Build a materialized view for the moving average and point QuickSight to the Redshift database.

  • Launch an Amazon EMR cluster running Apache Spark SQL on a daily schedule to compute the moving average, write the results back to S3, and expose the data to analysts through an Athena Federated Query connector to EMR.

  • Use AWS Glue crawler to catalog the S3 folder, query the data with Amazon Athena, and create a SQL view that uses window functions to calculate the 7-day moving average. Grant QuickSight access to Athena and schedule the query daily with Amazon EventBridge.

  • Create an AWS Glue DataBrew project to compute the moving average, export the output to Amazon OpenSearch Service, and connect QuickSight to OpenSearch for dashboards.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot