CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is developing an ordinary least-squares model to predict daily revenue, a strictly positive continuous variable. The revenue distribution is highly right-skewed and, after an initial linear fit, the residual-versus-fitted plot shows a wedge-shaped pattern that widens as fitted values increase, indicating heteroscedasticity. The scientist needs a single data transformation on the response variable that (1) can stabilize the variance and approximate normality and (2) lets the optimal transformation be chosen from a continuum of power functions using maximum-likelihood estimation. Which transformation should be applied before refitting the model?

Apply a Box-Cox power transformation to the revenue variable.
Take the natural logarithm (ln) of the revenue variable.
Standardize the revenue variable with a z-score (mean 0, standard deviation 1).
Rescale the revenue variable to the 0-1 range with min-max normalization.

CompTIA DataX DY0-001 (V1)

Modeling, Analysis, and Outcomes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is heteroscedasticity and why is it problematic in regression models?

Why is the Box-Cox transformation preferred over a fixed transformation like natural logarithm?

What are the limitations of z-score standardization and min-max normalization in addressing heteroscedasticity?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is heteroscedasticity and why is it problematic in regression models?

Why is the Box-Cox transformation preferred over a fixed transformation like natural logarithm?

What are the limitations of z-score standardization and min-max normalization in addressing heteroscedasticity?

Report Issue