CompTIA DataX DY0-001 (V1) Practice Question

A data science team at a financial institution is tasked with developing a machine learning model to detect fraudulent transactions. The team faces two major challenges: the dataset of known fraudulent transactions is extremely small, creating a severe class imbalance, and the raw data contains sensitive PII, which restricts its use due to privacy regulations. In this context, what is the most compelling rationale for generating synthetic data?

  • To generate a completely new dataset of hypothetical future transactions, allowing the model to anticipate novel fraud patterns that have not yet occurred.

  • To augment the minority class (fraudulent transactions) and create a more balanced dataset for model training, while simultaneously ensuring that no real customer PII is exposed during development.

  • To replace the original dataset entirely, thereby reducing data storage costs and simplifying the data ingestion pipeline for faster processing.

  • To perform data obfuscation on the existing PII fields, which is a regulatory requirement before any data can be used for analytics.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot