🔥 40% Off Crucial Exams Memberships — This Week Only

3 days, 9 hours remaining!

AWS Certified AI Practitioner AIF-C01 Practice Question

A development team wants a repeatable way to compare how accurately two large language models create English text summaries. Which approach provides an objective, standardized measurement of model quality?

Fine-tune the models further and compare their final training loss values.
Gather informal opinions from a small group of employees after ad-hoc testing.
Run both models on an open benchmark dataset for summarization and calculate ROUGE scores.
Ask each model to critique and score its own summaries.

AWS Certified AI Practitioner AIF-C01

Applications of Foundation Models

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot