CompTIA Data+ DA0-002 (V2) Practice Question

A data analyst is preparing a 250 000-row customer data set to train a supervised churn-prediction model. The target column, Churn_Flag, contains Yes/No values for 248 700 customers, while the remaining 1 300 rows have NULL in that column only; every feature in those 1 300 rows is otherwise complete and within expected ranges. Exploratory checks show that dropping 1 300 records will not materially change the class balance or statistical power of the model. The machine-learning library being used will raise an error if the target variable is missing. Which data-cleansing technique is MOST appropriate for handling the 1 300 affected rows before modeling?

Delete the 1 300 rows that have a NULL value in Churn_Flag before training the model.
Bin Churn_Flag into broader categories and keep the rows to maximize training data size.
Impute each missing Churn_Flag with the most common class so the overall distribution is preserved.
Apply min-max scaling to the numeric features so the algorithm can ignore the NULL labels.

CompTIA Data+ DA0-002 (V2)

Data Acquisition and Preparation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA Data+ DA0-002 (V2) Practice Question

Answer Description

Ask Bash

Why is it important to delete rows with NULL values in the target variable instead of imputing them?

What does class balance mean in the context of machine learning?

How does listwise deletion impact the statistical power of a machine learning model?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA Data+ DA0-002 (V2) Practice Question

Report Issue

Answer Description

Ask Bash

Why is it important to delete rows with NULL values in the target variable instead of imputing them?

What does class balance mean in the context of machine learning?

How does listwise deletion impact the statistical power of a machine learning model?

Report Issue