CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is developing a linear regression model to predict housing prices. An initial analysis of the residuals versus fitted values plot shows a distinct curvilinear pattern with variance increasing as the predicted value increases, indicating the presence of both non-linearity and heteroscedasticity. The dependent variable, house_price, is right-skewed and contains several valid entries that are exactly zero, representing land-only sales. The goal is to transform the house_price variable to better meet the assumptions of linear regression.

Which of the following data transformation techniques is the most appropriate and robust approach in this scenario?

  • One-hot encoding the dependent variable.

  • A Box-Cox transformation after adding a constant to the dependent variable.

  • A standard Box-Cox transformation on the dependent variable.

  • A standard logarithmic transformation on the dependent variable.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot