CompTIA DataX DY0-001 (V1) Practice Question

A data science team at a financial institution is tasked with developing a machine learning model to detect fraudulent transactions. The team faces two major challenges: the dataset of known fraudulent transactions is extremely small, creating a severe class imbalance, and the raw data contains sensitive PII, which restricts its use due to privacy regulations. In this context, what is the most compelling rationale for generating synthetic data?

To generate a completely new dataset of hypothetical future transactions, allowing the model to anticipate novel fraud patterns that have not yet occurred.
To augment the minority class (fraudulent transactions) and create a more balanced dataset for model training, while simultaneously ensuring that no real customer PII is exposed during development.
To replace the original dataset entirely, thereby reducing data storage costs and simplifying the data ingestion pipeline for faster processing.
To perform data obfuscation on the existing PII fields, which is a regulatory requirement before any data can be used for analytics.

CompTIA DataX DY0-001 (V1)

Operations and Processes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is synthetic data, and how is it generated?

How does synthetic data address class imbalance in machine learning?

How does synthetic data protect PII while enabling data analysis?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is synthetic data, and how is it generated?

How does synthetic data address class imbalance in machine learning?

How does synthetic data protect PII while enabling data analysis?

Report Issue