CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is analyzing a univariate time-series that contains 120 monthly observations of an online retailer's gross revenue (January 2015 - December 2024). A visualization indicates that both the mean and the variance grow over time. To verify this, the analyst runs two unit-root tests:

  • Augmented Dickey-Fuller (ADF): test statistic = -2.10, p-value = 0.23
  • KPSS (trend): test statistic = 0.95, p-value = 0.01 (critical value = 0.463)

The analyst plans to fit an ordinary least-squares (OLS) regression to the level data for forecasting.

Which statement identifies the most critical data issue and a suitable first step to address it before modeling?

  • Sparse data leading to overfitting; aggregate the monthly data to quarterly frequency and impute missing observations.

  • Non-stationarity caused by a stochastic trend and changing variance; apply a first-order differencing (and/or a variance-stabilizing transformation) to make the series stationary before modeling.

  • Multicollinearity among explanatory variables; compute variance inflation factors and drop highly correlated predictors.

  • Non-linearity in the relationship between revenue and time; replace the linear regression with a higher-order polynomial without altering the data.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot