CompTIA DataX DY0-001 (V1) Practice Question

A data science team is developing a multiple linear regression model to forecast monthly energy consumption. Diagnostic checks show a Pearson correlation of 0.96 between the predictors "heated_area_sqft" and "total_floor_area_sqft" and Variance Inflation Factors near 25 for both variables. When the model is refit on different random training samples, the coefficients for these two predictors vary widely even though the model's R² remains stable. Domain experts require that both predictors stay in the model. Which modeling adjustment is most likely to mitigate the multicollinearity problem while retaining both variables and without creating new synthetic features?

  • Increase the training-set size by appending several additional months of data before refitting the ordinary least-squares model.

  • Standardize all predictors to mean-zero, unit-variance and refit ordinary least squares.

  • Fit a ridge regression model and tune the L2 penalty (λ) by cross-validation.

  • Add an interaction term heated_area_sqft × total_floor_area_sqft to capture their joint effect.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot