AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company stores order data as daily Parquet files in an Amazon S3 prefix. A data engineer builds an AWS Glue Spark job that runs hourly to load new data into Amazon Redshift. The job must skip files that were already processed and should replay those files only if the job is rerun after a failure. Which AWS Glue feature should the engineer enable to meet these requirements?

  • Configure continuous logging to Amazon CloudWatch Logs.

  • Turn on speculative execution in the Spark configuration.

  • Set the job type to streaming and process the data with Apache Hudi.

  • Enable job bookmarks for the AWS Glue job.

AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot