CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is building a fraud detection model for a financial institution. The historical transaction dataset is highly imbalanced, with fraudulent transactions (the minority class) accounting for only 0.5% of the data. A baseline model trained on this data shows high accuracy but has an extremely low recall for the fraud class. The scientist needs to apply a mitigation technique to rebalance the training data. Which of the following approaches best addresses the class imbalance by creating new, varied examples for the minority class, thereby reducing the specific risk of overfitting that arises from simple duplication?

  • Synthetic Minority Oversampling Technique (SMOTE)

  • Applying L2 regularization to the baseline model

  • Randomly undersampling the non-fraudulent (majority) class

  • Randomly oversampling the fraudulent (minority) class by duplication

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot