AWS Certified Solutions Architect Associate SAA-C03 Practice Question
Which of the following statements accurately describes how AWS Lambda scales in response to increased traffic?
Lambda can only process one invocation at a time; additional requests are queued until the first finishes.
When traffic increases, Lambda launches new EC2 instances in your VPC to run the function code directly.
You must pre-allocate execution environments before Lambda can scale past 100 concurrent invocations.
Lambda automatically creates additional execution environments to handle concurrent invocations up to the configured account or function concurrency quota.
AWS Lambda automatically provisions additional execution environment instances so that multiple invocations of a function can run in parallel. This scaling continues until the function or account-level concurrency quota is reached. You do not have to launch servers or pre-allocate capacity for on-demand scaling, although you can optionally configure reserved or provisioned concurrency for stricter control. Options that describe fixed limits of 100 concurrent invocations, mandatory pre-allocation, or launching of EC2 instances are inaccurate because Lambda abstracts server management and scales well beyond 100 by default (1,000 concurrent executions per Region, unless increased) and does not rely on directly launching EC2 instances. Lambda also does not reject traffic unless quotas are exceeded or reserved concurrency is set too low.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What does it mean for AWS Lambda to scale automatically?