CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is dealing with a binary fraud-detection dataset that contains 1 000 000 observations, of which only 0.2 % are labeled as fraud. The model of choice is a gradient-boosted decision tree. The scientist plans to mitigate the extreme class imbalance with the Synthetic Minority Over-sampling Technique (SMOTE) and to assess performance with 5-fold stratified cross-validation before evaluating on a separate, untouched test set whose class distribution mirrors production.

Which procedure is the most appropriate for oversampling in this scenario so that the minority class is strengthened without introducing optimistic validation bias or excessive overfitting?

Inside each cross-validation fold, apply SMOTE solely to the training partition, then train the model on that augmented data and validate on the untouched fold hold-out.
Build an ensemble that draws bootstrap samples from the majority class only, keeping each minority instance exactly once in every bootstrap replica.
Before cross-validation, duplicate every minority-class record 499 times to obtain a perfectly balanced 1:1 class ratio, then train and validate on this expanded dataset.
Run SMOTE on the entire dataset first so that synthetic minority records are present in every cross-validation fold.

CompTIA DataX DY0-001 (V1)

Mathematics and Statistics

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is SMOTE and how does it work?

Why is applying SMOTE only to the training partition important in cross-validation?

What is stratified cross-validation and why is it suitable here?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is SMOTE and how does it work?

Why is applying SMOTE only to the training partition important in cross-validation?

What is stratified cross-validation and why is it suitable here?

Report Issue