A data analyst is preparing a 250 000-row customer data set to train a supervised churn-prediction model. The target column, Churn_Flag, contains Yes/No values for 248 700 customers, while the remaining 1 300 rows have NULL in that column only; every feature in those 1 300 rows is otherwise complete and within expected ranges. Exploratory checks show that dropping 1 300 records will not materially change the class balance or statistical power of the model. The machine-learning library being used will raise an error if the target variable is missing. Which data-cleansing technique is MOST appropriate for handling the 1 300 affected rows before modeling?
Bin Churn_Flag into broader categories and keep the rows to maximize training data size.
Impute each missing Churn_Flag with the most common class so the overall distribution is preserved.
Delete the 1 300 rows that have a NULL value in Churn_Flag before training the model.
Apply min-max scaling to the numeric features so the algorithm can ignore the NULL labels.
Because the missing values occur in the target variable-not in the predictor features-the rows cannot contribute to supervised learning. Imputing or transforming the missing target would inject fabricated labels and risk corrupting the model. Binning or scaling features does nothing to resolve the missing label, and the library will still fail. Given that the affected subset represents only 0.52 % of the data and its removal does not bias the class distribution, listwise deletion (dropping those rows) is the proper cleansing step. Imputing the mode would create false churn outcomes; scaling features leaves the NULLs untouched; and binning the target is impossible without a value to bin, so those choices are incorrect.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
Why is it important to delete rows with NULL values in the target variable instead of imputing them?
Open an interactive chat with Bash
What does class balance mean in the context of machine learning?
Open an interactive chat with Bash
How does listwise deletion impact the statistical power of a machine learning model?
Open an interactive chat with Bash
CompTIA Data+ DA0-002 (V2)
Data Acquisition and Preparation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .