AWS Certified AI Practitioner AIF-C01 Practice Question
A team is selecting pre-training data for a foundation language model that must perform well across a broad range of topics. Which data-selection strategy best supports this goal?
Select only the shortest sentences available to minimize training computation.
Gather a large, diverse, high-quality corpus that spans multiple domains and languages.
Rely exclusively on AI-generated text to avoid data licensing issues.
Use only domain-specific documents from the company's industry to reduce noise.
For a general-purpose foundation model, diversity and coverage in the training corpus are critical. Using text drawn from many domains, formats, and languages exposes the model to varied vocabulary, styles, and contexts, letting it learn representations that transfer to new tasks. Limiting the data to one industry, filtering only for very short sentences, or relying solely on synthetic text all reduce real-world variety and increase the risk of bias or poor generalization.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a foundation language model?
Open an interactive chat with Bash
Why is data diversity important in training foundation models?
Open an interactive chat with Bash
How does using synthetic text impact training a foundation model?
Open an interactive chat with Bash
Why is diversity in training data important for a language model?
Open an interactive chat with Bash
What does a foundation language model do?
Open an interactive chat with Bash
Why is relying solely on AI-generated text problematic for training data?
Open an interactive chat with Bash
AWS Certified AI Practitioner AIF-C01
Fundamentals of Generative AI
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .