AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company's nightly AWS Glue 3.0 Spark job reads 3 TB of Parquet data from Amazon S3 and loads it into an Amazon Redshift table. The job used to finish in 40 minutes, but the most recent runs take more than 2 hours, and several tasks stay in the READY state for an extended time. To quickly identify stage bottlenecks such as partition skew or insufficient executor memory without increasing cost, which action should a data engineer perform first?

Increase the job's maximum DPUs and enable continuous logging to Amazon CloudWatch Logs.
Open the job's Spark UI from the AWS Glue console and review stage and executor metrics in the Spark History Server.
Configure the CloudWatch log group for the job to stream to Amazon S3 and query the logs with Amazon Athena.
Enable job bookmarks so the job can skip partitions that have already been processed.

AWS Certified Data Engineer Associate DEA-C01

Data Operations and Support

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is the Spark History Server in AWS Glue?

What is partition skew in Spark jobs?

What are DPUs in AWS Glue, and how do they affect job performance?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is the Spark History Server in AWS Glue?

What is partition skew in Spark jobs?

What are DPUs in AWS Glue, and how do they affect job performance?

Report Issue