AWS Certified Data Engineer Associate DEA-C01 Practice Question
You manage an Amazon EKS cluster that runs containerized Apache Spark batch jobs that transform data in Amazon S3. The cluster uses a fixed managed node group of twenty m5.xlarge On-Demand instances. During nightly runs CPU utilization exceeds 80 percent and jobs slow, but daytime utilization is under 10 percent. You must boost performance and cut idle costs with minimal operations effort. Which approach meets these goals?
Increase the existing node group to forty m5.xlarge instances and enable vertical pod autoscaling for Spark executors to remove resource contention.
Install the Kubernetes Cluster Autoscaler on the EKS cluster, create a managed node group that mixes On-Demand and Spot Instances, and set CPU and memory requests for all Spark pods.
Migrate the Spark containers to Amazon ECS and enable Service Auto Scaling based on average CPU utilization across tasks.
Create an EKS Fargate profile for the Spark namespace so every Spark pod runs on Fargate while keeping the existing node group for system pods.
Using the Kubernetes Cluster Autoscaler with an EKS managed node group that contains a mix of On-Demand and Spot Instances allows the cluster to add capacity when Spark jobs need it and automatically scale in when demand falls. Defining CPU and memory requests for the Spark driver and executor pods gives the autoscaler the information it needs to schedule additional nodes only when required. This improves job throughput at night while avoiding unutilized compute during the day, and Spot Instances lower cost further. Moving all workloads to Fargate would simplify management but is typically more expensive for long-running, compute-intensive Spark jobs. Migrating to ECS changes the platform and offers no inherent cost benefit for Spark. Simply doubling the node group and relying on vertical pod autoscaling removes the performance bottleneck but increases costs and does not reclaim idle capacity.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the Kubernetes Cluster Autoscaler?
Open an interactive chat with Bash
What are Spot Instances and how do they reduce costs?
Open an interactive chat with Bash
Why are CPU and memory requests important in Spark pods?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .