CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is performing exploratory data analysis (EDA) on a dataset for a real estate valuation model. The dataset includes a feature named view_quality, which is rated by professional assessors on a custom scale: "No View", "Partial Obstruction", "Standard", "Good", and "Excellent". The team is debating the most appropriate way to handle this feature for a multiple linear regression model versus a gradient boosting machine (GBM).

Which of the following statements most accurately describes the view_quality feature and the implications for its use in modeling?

view_quality is a continuous variable that has been binned. For use in a linear regression model, the mid-points of the implied continuous range for each category should be calculated and used as the feature value. For a GBM, this feature can be used directly.
view_quality is an ordinal variable. For a linear regression model, treating it as a continuous integer (e.g., 0-4) assumes equidistant spacing between categories, which is likely false and could violate model assumptions. For a GBM, integer encoding is generally effective as the model can create splits at any point along the ordered values.
view_quality is a nominal variable. It must be one-hot encoded for both linear regression and GBMs to avoid introducing a false sense of order, which would negatively impact the performance of both model types.
view_quality is a discrete variable. It can be used directly in a linear regression model without transformation because the model will interpret the integer values as distinct points. For a GBM, it should be treated as a categorical feature to allow for optimal splits.

CompTIA DataX DY0-001 (V1)

Modeling, Analysis, and Outcomes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is an ordinal variable?

Why is integer encoding problematic for ordinal variables in linear regression?

How do tree-based models like GBMs handle ordinal variables?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is an ordinal variable?

Why is integer encoding problematic for ordinal variables in linear regression?

How do tree-based models like GBMs handle ordinal variables?

Report Issue