CompTIA DataX DY0-001 (V1) Practice Question

A data science team trains an XGBoost model to predict loan default. The library's default feature-importance plot, which uses the gain metric, ranks the variable Customer_ID highest, while Age appears near the bottom. When the team computes permutation importance on a held-out validation set, Age rises to the top and Customer_ID drops sharply. Which explanation best accounts for the conflicting importance rankings?

The conflict arises because permutation importance for classification relies on the Gini impurity formula used in regression trees, which is incompatible with XGBoost models.
Gain importance tends to inflate the score of features that have many unique values or potential split points, such as an identifier; permutation importance measures the drop in validation performance and is therefore much less affected by this cardinality bias.
Gain importance ignores how frequently a feature is selected for splitting, so variables like Age that create large gains only a few times are hidden from the ranking.
Permutation importance is calculated only on the training data, so it undervalues features that generalize well and makes Customer_ID look weaker than it really is.

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

Why does gain importance favor features with many unique values?

How does permutation importance reflect generalization value?

What is the main difference in how gain and permutation importance are computed?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

Why does gain importance favor features with many unique values?

How does permutation importance reflect generalization value?

What is the main difference in how gain and permutation importance are computed?

Report Issue