CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is tasked with building a predictive model for a rare disease, where only 1% of the patient data belongs to the positive class. To address this severe class imbalance, the data scientist decides to use the Synthetic Minority Oversampling Technique (SMOTE). Which of the following statements provides the most accurate description of the fundamental process SMOTE uses to generate new samples?

  • It duplicates existing minority class samples at random until the class distribution is balanced.

  • It removes samples from the majority class at random to create a more balanced class distribution.

  • It creates synthetic samples on the line segments that join an existing minority class instance with one of its k-nearest neighbors from the same minority class.

  • It generates new minority samples by creating a probability distribution from the features of the minority class and sampling from it.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot