AWS Certified Solutions Architect Associate SAA-C03 Practice Question
A tech startup hosts its web application on a fleet of Amazon EC2 instances behind an Application Load Balancer. They experience unpredictable spikes in user traffic and want to ensure that their application scales automatically to maintain performance during peak times. As a Solutions Architect, which metric would you recommend they use to trigger scaling actions to handle increased load effectively?
Monitor the CPU utilization of the EC2 instances to trigger scaling actions.
Use the available memory on the EC2 instances to trigger scaling actions.
Base scaling actions on the average network latency at the Application Load Balancer.
Utilize the request count per target from the Application Load Balancer to trigger scaling actions.
Utilizing the request count per target from the Application Load Balancer to trigger scaling actions is the most effective metric in this scenario. This metric directly reflects the incoming user load and helps ensure that the number of EC2 instances scales in proportion to the demand, maintaining application performance during traffic spikes. Monitoring CPU utilization or available memory on EC2 instances might not accurately capture the need for scaling because resource utilization can vary based on many factors and may not correlate directly with user traffic. Basing scaling actions on network latency at the load balancer is also less effective because latency can be affected by factors other than user load, such as network issues, and may not provide a timely trigger for scaling actions.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the request count per target metric?
Open an interactive chat with Bash
Why is CPU utilization not a good metric for scaling in this scenario?
Open an interactive chat with Bash
How does an Application Load Balancer help with scaling decisions?