AWS Certified AI Practitioner AIF-C01 Practice Question
A startup is testing a new text-generation foundation model to confirm that its replies sound polite and helpful. Which evaluation method is most appropriate for measuring these subjective qualities?
Record the model's average inference latency in milliseconds.
Compute the BLEU score of each response compared with a reference answer.
Count the total number of output tokens produced per response.
Have human reviewers rate the model's responses against a qualitative rubric.
Politeness and helpfulness are subjective, qualitative attributes that automated metrics such as BLEU, latency, or token counts cannot reliably capture. Instead, organizations typically rely on human evaluation: reviewers read sample responses and rate them against a rubric that defines desired qualities (for example, politeness levels or usefulness of information). Human judgment can account for nuance, tone, and context in ways automated statistics cannot. BLEU focuses on n-gram overlap with reference text, latency measures performance speed, and token counts track output length-none of which directly indicates whether answers are polite or helpful.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
Why can't automated metrics like BLEU score capture subjective qualities like politeness and helpfulness?
Open an interactive chat with Bash
What is a qualitative rubric, and how is it used for evaluating AI responses?
Open an interactive chat with Bash
How does human evaluation account for nuances in context and tone compared to automated metrics?
Open an interactive chat with Bash
AWS Certified AI Practitioner AIF-C01
Applications of Foundation Models
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .