CompTIA DataX DY0-001 (V1) Practice Question

A healthcare AI startup is developing a model to predict rare disease outbreaks. Their real-world dataset is small, highly imbalanced, and contains sensitive Personally Identifiable Information (PII). To augment their training data, they require a synthetic data generation method that can model complex, non-linear data relationships to create high-fidelity samples while also providing strong, provable privacy guarantees.

Which of the following approaches best meets these requirements?

  • Creating new datasets via statistical bootstrapping with replacement from the original data.

  • Employing a standard Variational Autoencoder (VAE) to generate samples from a learned latent space.

  • Using a Generative Adversarial Network (GAN) that incorporates differential privacy.

  • Applying the Synthetic Minority Over-sampling Technique (SMOTE) to generate new minority class instances.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot