CompTIA DataX DY0-001 (V1) Practice Question

A machine learning engineer is tasked with building a model and estimating its generalization error. They use a single loop of k-fold cross-validation. In each fold, they perform hyperparameter tuning using grid search on the training data, identify the best parameters, and then evaluate the model with these parameters on the validation set. The final performance is reported as the average of the scores across all folds. This process results in a model that performs exceptionally well during this cross-validation procedure but fails to generalize to new production data. Which of the following is the most likely cause for this discrepancy?

Standard k-fold cross-validation is only appropriate for regression models, and a stratified approach should have been used for this classification task.
The process causes information leakage, leading to an optimistic performance estimate because the validation data influences both hyperparameter selection and performance evaluation.
The model is underfitting due to the reduced size of the training partition created in each fold of the cross-validation process.
The grid search for hyperparameter tuning is computationally inefficient and likely resulted in a globally suboptimal model.

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is information leakage in machine learning?

How does nested cross-validation avoid information leakage?

Why is stratified k-fold cross-validation often preferred for classification tasks?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is information leakage in machine learning?

How does nested cross-validation avoid information leakage?

Why is stratified k-fold cross-validation often preferred for classification tasks?

Report Issue