AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team uses AWS Step Functions to launch a transient Amazon EMR 6.x cluster nightly to run a PySpark ETL step, after which the cluster terminates automatically. When a step fails, the cluster shuts down before engineers can view Spark driver and executor logs. The team must retain detailed logs and the Spark history UI for post-mortem analysis while adding minimal EC2 cost. Which action meets these requirements?

  • Enable CloudTrail data events on the input data bucket to capture Spark driver logs for later review.

  • Specify an Amazon S3 log URI and enable persistent application user interfaces for Spark when creating the EMR cluster.

  • Configure EMRFS Consistent View so logs are automatically synchronized to Amazon S3 after each task.

  • Enable termination protection and disable auto-termination so the cluster remains available for manual log retrieval via SSH.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot