AWS Certified AI Practitioner AIF-C01 Practice Question

A developer is calling a large language model through Amazon Bedrock. The responses are useful but far longer than the application's chat window can display. Which inference parameter should the developer reduce to directly limit the length of each response?

Add additional stop sequences, such as common punctuation marks.
Decrease the temperature value used during sampling.
Reduce the Top-P (nucleus sampling) value.
Lower the maximum number of output tokens the model is allowed to generate.

AWS Certified AI Practitioner AIF-C01

Applications of Foundation Models

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified AI Practitioner AIF-C01 Practice Question

Answer Description

Ask Bash

What is a token in the context of large language models?

How does the maxTokens parameter affect model responses?

What is the role of Temperature and Top-P in language models?

What is a token in the context of large language models?

How does the maxTokens parameter influence model outputs?

What is the role of temperature in large language models?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified AI Practitioner AIF-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is a token in the context of large language models?

How does the maxTokens parameter affect model responses?

What is the role of Temperature and Top-P in language models?

What is a token in the context of large language models?

How does the maxTokens parameter influence model outputs?

What is the role of temperature in large language models?

Report Issue