CompTIA DataX DY0-001 (V1) Practice Question

Your analytics team is building a daily cost-forecasting model for an international e-commerce company. You receive two data sets:

  • Orders.csv - purchase orders time-stamped at 00:00 for each calendar date.
  • FX_rates.csv - daily closing foreign-exchange rates recorded at 17:00 New York time and stored under the same calendar date.

After an inner join on calendar date, you train a regression model that predicts a day's average order cost from the FX rate and other features. During testing, the model's accuracy is unrealistically high, and an audit shows that the FX rate being used actually reflects market conditions after the orders were placed.

Which data issue is present, and what should be the first corrective action during exploratory data analysis (EDA)?

  • Lagged observations are present; shift the FX_rate series back by one day to align timestamps before joining.

  • Seasonality is present; perform STL decomposition to remove periodic components from the FX_rate series.

  • The FX_rate variable is non-linear; apply a logarithmic transformation to linearize its relationship with cost.

  • Multicollinearity exists among predictors; drop highly correlated currency features based on VIF analysis.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot