AWS Certified AI Practitioner AIF-C01 Practice Question

A developer is calling a large language model through Amazon Bedrock. The responses are useful but far longer than the application's chat window can display. Which inference parameter should the developer reduce to directly limit the length of each response?

  • Add additional stop sequences, such as common punctuation marks.

  • Decrease the temperature value used during sampling.

  • Reduce the Top-P (nucleus sampling) value.

  • Lower the maximum number of output tokens the model is allowed to generate.

AWS Certified AI Practitioner AIF-C01
Applications of Foundation Models
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot