CompTIA DataX DY0-001 (V1) Practice Question

A financial services company needs to generate a high-fidelity synthetic dataset for stress-testing its fraud detection models. The original dataset contains millions of transactions, with complex, non-linear correlations between features that must be preserved. The primary goal is to create new, unseen data that follows the same statistical distribution as the real data, rather than just re-balancing classes or masking values. Which of the following describes the most suitable creation process for this requirement?

  • Modify the original dataset by adding a small amount of random statistical noise to numerical columns and systematically swapping the values between different records in categorical columns. This process is repeated until a new dataset of the desired size is created.

  • Identify instances of the minority class within the dataset. For each instance, find its k-nearest neighbors and create new synthetic samples by interpolating between the instance and its neighbors along a randomly chosen vector.

  • Train two neural networks in an adversarial process. One network, the generator, attempts to create realistic data from a random input, while a second network, the discriminator, is trained to differentiate between the real data and the generator's output. The system is trained until the generator can produce data that consistently fools the discriminator.

  • Implement an encoder-decoder model where the encoder maps input data to a probabilistic latent space. New data is then created by taking samples from this latent space and passing them through the decoder to generate data points that follow the learned distribution.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot