CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is tasked with building a linear regression model to predict housing prices. Initial exploratory data analysis reveals that the target variable, Price, is continuous, strictly positive, and exhibits significant right-skew. Additionally, the dataset includes a key predictor, Geographic_Region, which is a nominal categorical feature with five unique, non-ordinal values. To meet the assumptions of linear regression and properly incorporate the categorical data, which of the following data enrichment strategies is the most appropriate for the data scientist to implement?

  • Apply standardization to the Price variable and create cross-terms for the Geographic_Region variable.

  • Apply binning to the Price variable and normalization to the Geographic_Region variable.

  • Apply a Box-Cox transformation to the Price variable and one-hot encoding to the Geographic_Region variable.

  • Apply a log transformation to the Price variable and label encoding to the Geographic_Region variable.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot