CompTIA DataX DY0-001 (V1) Practice Question

A financial services firm is developing a sophisticated machine learning model to detect new and emerging types of fraudulent transactions. The existing dataset contains a very small number of known fraud cases, which are insufficient for training a robust model. To address this, the data science team uses a Generative Adversarial Network (GAN) trained on the existing fraud samples to generate a larger, augmented dataset of fraudulent transactions. Which of the following describes the MOST significant limitation of this approach for its intended purpose?

  • The synthetic data will not perfectly match the statistical distribution of the real fraudulent data, leading to a distribution mismatch that reduces model performance.

  • The generative model will likely fail to produce synthetic examples of novel fraud patterns that are fundamentally different from the known fraud cases in the original dataset.

  • The generative model may amplify existing biases present in the small sample of known fraud cases, leading to discriminatory model behavior against certain user groups.

  • The computational cost and time required to train the GAN and generate a large volume of high-fidelity synthetic data will be prohibitively expensive.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot