AWS Certified Data Engineer Associate DEA-C01 Practice Question

An AWS Glue Spark job transforms 100 GB of CSV data in Amazon S3 and loads it into an Amazon Redshift cluster through a JDBC connection. The run time increased from 15 to 45 minutes. CloudWatch shows Glue CPU under 30 percent, but Redshift commit-queue waits and a saturated WLM load queue. Without adding more DPUs or cluster nodes, which action will MOST reduce job duration?

  • Reduce the number of WLM query slots from 5 to 2 to give each slot more memory.

  • Write the transformed data to Amazon S3 in Parquet format and have the job invoke an Amazon Redshift COPY command to load the files.

  • Enable Amazon Redshift Concurrency Scaling for the WLM queue used by the job.

  • Double the Glue worker count and upgrade from G.1X to G.2X workers.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot