CompTIA DataX DY0-001 (V1) Practice Question

A data science team has developed a sophisticated classification model to predict equipment failure in a manufacturing plant. The model was trained on a dataset comprising 95% of the available historical data, achieving an accuracy of 98.5%. However, when the model was evaluated against the remaining 5% holdout dataset, its accuracy dropped to 72%. Further testing on live production data showed similarly degraded performance.

Which of the following is the most critical conclusion the team should draw from these results?

  • The significant drop in accuracy is primarily caused by concept drift between the training and holdout datasets.

  • The model's high in-sample performance is not indicative of its poor out-of-sample generalization.

  • The model is underfitting due to insufficient feature engineering in the training phase.

  • The in-sample evaluation is the more reliable metric, suggesting the out-of-sample data is anomalous or corrupted.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot