CompTIA Data+ DA0-002 (V2) Practice Question

You are preparing a set of numeric customer-behavior features for a k-means clustering model. One of the variables, lifetime_value, is highly right-skewed and contains several extreme outliers that would dominate Euclidean distance calculations if left untreated. You want each feature to contribute proportionally to the distance metric without letting those few large values distort the scale. Which preprocessing technique should you apply before running the clustering algorithm?

Apply a robust scaler that centers on the median and scales by the interquartile range.
Apply min-max scaling to force every feature into a 0-1 range.
Apply Z-score standardization so each feature has mean 0 and standard deviation 1.
Apply a logarithmic transformation followed by min-max scaling.

CompTIA Data+ DA0-002 (V2)

Data Acquisition and Preparation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA Data+ DA0-002 (V2) Practice Question

Answer Description

Ask Bash

Why is Robust Scaling particularly suited for handling outliers?

How does Euclidean distance affect clustering when using raw data with outliers?

How does Min-Max Scaling compare to Robust Scaling in handling outliers?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA Data+ DA0-002 (V2) Practice Question

Report Issue

Answer Description

Ask Bash

Why is Robust Scaling particularly suited for handling outliers?

How does Euclidean distance affect clustering when using raw data with outliers?

How does Min-Max Scaling compare to Robust Scaling in handling outliers?

Report Issue