CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is using the k-means algorithm for customer segmentation. After visualizing the results, they observe that the algorithm fails to correctly partition several distinct, elongated customer groups, merging them into single, large clusters. What is the most likely underlying reason for this suboptimal clustering performance?

The value of 'k' was incorrectly chosen, and a different number of clusters would resolve the issue.
The initial placement of centroids was suboptimal, leading to convergence on a local minimum.
K-means inherently assumes that clusters are convex and isotropic, making it struggle with elongated or irregularly shaped clusters.
The algorithm is struggling due to unscaled numerical features and the presence of categorical data.

Report Issue

Answer Description

The correct answer is that k-means inherently assumes clusters are convex and isotropic. K-means clustering measures distance to a central point (the centroid) and aims to minimize the within-cluster sum of squares. This process naturally forms spherical or convex cluster shapes. When faced with data where the true underlying clusters are elongated or have irregular, non-convex shapes, k-means will often fail to identify them correctly, as described in the scenario.

Incorrectly choosing the value of 'k' would result in either splitting natural clusters or merging distinct ones, but it would not fundamentally solve the algorithm's inability to model non-spherical shapes. While suboptimal centroid initialization can lead to poor results by converging on a local minimum, this is a separate issue from the algorithm's intrinsic geometric bias, which is the direct cause for failing on elongated shapes regardless of the starting point. Similarly, the presence of unscaled features or categorical data are critical preprocessing concerns for k-means, but these issues would manifest differently, such as features with larger scales dominating the distance calculations, rather than specifically failing to model an elongated geometry.

Ask Bash

Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.

Why does k-means struggle with elongated or irregularly shaped clusters?

Open an interactive chat with Bash

What does 'convex and isotropic' mean in the context of k-means clustering?

Open an interactive chat with Bash

What other clustering methods can handle elongated or irregularly shaped clusters?

Open an interactive chat with Bash

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

Why does k-means struggle with elongated or irregularly shaped clusters?

What does 'convex and isotropic' mean in the context of k-means clustering?

What other clustering methods can handle elongated or irregularly shaped clusters?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams