CompTIA DataX DY0-001 (V1) Practice Question

While building a logistic-regression model to predict loan default, your training data show that 8 % of values for the numeric attribute debt_to_income_ratio are missing. Exploratory analysis reveals that the probability of a value being missing increases for borrowers who are younger than 25 and who have less than one year of employment, but within those strata the missingness appears random. The feature is continuous, right-skewed, and has a strong influence on the target. Regulation requires that the chosen imputation technique preserve the variable's variance and explicitly propagate the extra uncertainty introduced by the missing data to any downstream parameter estimates. Which imputation type is the most appropriate to meet these constraints?

  • Single mean imputation calculated within each cross-validation fold

  • Multiple imputation with pooled estimates across several completed data sets

  • Listwise deletion of all records that lack debt_to_income_ratio

  • k-nearest-neighbors imputation using Euclidean distance on standardized predictors

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot