AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company's nightly AWS Glue 3.0 Spark job reads 3 TB of Parquet data from Amazon S3 and loads it into an Amazon Redshift table. The job used to finish in 40 minutes, but the most recent runs take more than 2 hours, and several tasks stay in the READY state for an extended time. To quickly identify stage bottlenecks such as partition skew or insufficient executor memory without increasing cost, which action should a data engineer perform first?
Open the job's Spark UI from the AWS Glue console and review stage and executor metrics in the Spark History Server.
Enable job bookmarks so the job can skip partitions that have already been processed.
Configure the CloudWatch log group for the job to stream to Amazon S3 and query the logs with Amazon Athena.
Increase the job's maximum DPUs and enable continuous logging to Amazon CloudWatch Logs.
The AWS Glue console exposes a Spark History Server link for every completed or running job. Opening the Spark UI shows stage-level DAG details, task counts, executor memory usage, and shuffle statistics, allowing engineers to spot data skew or oversized partitions. This diagnostic step requires no additional DPUs or paid services. Simply raising DPUs or enabling continuous logging might mask the root cause and adds cost. Exporting logs to Amazon S3 for Athena analysis is slower and provides less granular executor information, while job bookmarks only control incremental loads and do not help troubleshoot performance.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the Spark History Server in AWS Glue?
Open an interactive chat with Bash
What is partition skew in Spark jobs?
Open an interactive chat with Bash
What are DPUs in AWS Glue, and how do they affect job performance?
Open an interactive chat with Bash
What is the Spark History Server in AWS Glue?
Open an interactive chat with Bash
What are data skew and shuffle statistics in Spark jobs?
Open an interactive chat with Bash
How do job bookmarks work in AWS Glue?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .