AWS Certified AI Practitioner AIF-C01 Practice Question
A development team wants a repeatable way to compare how accurately two large language models create English text summaries. Which approach provides an objective, standardized measurement of model quality?
Ask each model to critique and score its own summaries.
Fine-tune the models further and compare their final training loss values.
Run both models on an open benchmark dataset for summarization and calculate ROUGE scores.
Gather informal opinions from a small group of employees after ad-hoc testing.
Using a public benchmark dataset that was created specifically for text-summarization tasks lets the team run both models on the same inputs and then calculate established metrics such as ROUGE. Because the data and scoring procedure are fixed, results are repeatable and allow fair, quantitative comparison. Self-evaluation, informal employee feedback, or looking only at training loss do not yield an objective, standardized benchmark score.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the ROUGE metric in NLP?
Open an interactive chat with Bash
Why are benchmark datasets important for evaluating models?
Open an interactive chat with Bash
What are the drawbacks of using training loss for model evaluation?
Open an interactive chat with Bash
AWS Certified AI Practitioner AIF-C01
Applications of Foundation Models
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .