CompTIA DataX DY0-001 (V1) Practice Question

A data-science team is segmenting 10 000 customers represented by 85-dimensional feature vectors. They run k-means for k = 2 through 10 and obtain the metrics below (inertia is the within-cluster sum of squared errors):

k2345678910
Inertia (×10⁔)7.95.24.13.63.33.02.82.62.5
Avg. Silhouette0.470.620.590.560.530.500.490.480.47

Inertia shows an elbow at k = 4, whereas the average silhouette width peaks at k = 3 and then declines. Management wants clusters that are internally coherent and well separated while avoiding unnecessary splits. Which value of k should be chosen based on these results and accepted best practices?

  • k = 2 - prefer fewer clusters to minimize the risk of over-fitting the data.

  • k = 4 - pick the elbow point where inertia first shows diminishing returns.

  • k = 10 - use the value that gives the smallest possible inertia.

  • k = 3 - select the cluster count that maximizes the average silhouette width.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot