CompTIA DataX DY0-001 (V1) Practice Question

A data science team is implementing a Natural Language Generation (NLG) system to create summaries of complex legal documents. They are using a state-of-the-art transformer-based large language model. A critical issue they face is "hallucination," where the model generates plausible but factually incorrect information not present in the source text. To address this issue while maintaining high linguistic quality, which of the following strategies is the most appropriate to implement?

  • Implementing a retrieval-augmented generation (RAG) architecture to ground the model's output in the source documents.

  • Switching from subword tokenization to a character-level model to capture finer grammatical details.

  • Increasing the generation temperature to encourage more novel sentence structures.

  • Fine-tuning the model on a much larger, general-purpose text corpus to improve its world knowledge.

CompTIA DataX DY0-001 (V1)
Specialized Applications of Data Science
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot