AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team needs to build a daily pipeline that reads raw transaction files from Amazon S3, performs multi-table joins, and applies user-defined Python functions to generate analytics datasets. The team wants to store the transformation logic as PySpark scripts in Git and run the code on a fully managed, serverless service without administering clusters. Which AWS service meets these requirements with the least operational overhead?

  • Create an AWS Glue job that runs the PySpark script

  • Schedule an Amazon Athena query with Amazon EventBridge

  • Invoke the PySpark script from an AWS Step Functions workflow

  • Use AWS Glue DataBrew to build the transformations visually

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot