CompTIA DataX DY0-001 (V1) Practice Question

An engineering team is clustering a year's worth of telemetry from industrial boilers. Each record contains temperature (°C), pressure (kPa), and relative humidity (%). A preliminary K-means (Euclidean distance) run produces clusters that vary almost entirely by pressure, because that feature's numeric range dwarfs the others. The domain experts want to:

  1. Give every feature comparable influence in the distance calculations.
  2. Avoid compressing the effect of rare extreme readings.
  3. Convert the final cluster centroids back to the original physical units once the model is trained.

Which single preprocessing step best satisfies all three requirements before fitting the model?

  • Standardize each feature using z-score scaling (subtract the mean and divide by the standard deviation).

  • Discretize each numerical feature into decile bins and one-hot encode the resulting categories.

  • Log-transform the pressure and temperature features while leaving humidity unchanged.

  • Apply min-max normalization to map every feature linearly into the range .

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot