CompTIA DataX DY0-001 (V1) Practice Question

A payments-security team is clustering 100 000 transaction embeddings, each represented by 128 continuous features. They believe fraudulent user rings form clusters that are highly irregular in shape, vary greatly in size, and are surrounded by many benign transactions that should be labeled as noise. Because the true number of fraud rings is unknown, the team needs an algorithm that can discover an appropriate number of clusters on its own. For scalability, they will accelerate neighborhood queries with a k-d tree and aim for an overall runtime close to O(n log n). Which unsupervised technique best satisfies these requirements?

Expectation-Maximization Gaussian mixture modeling with Bayesian information criterion (BIC) to select the number of components
k-means clustering with the elbow method to determine the value of k
Agglomerative hierarchical clustering using Ward linkage and a dendrogram cutoff
Density-Based Spatial Clustering of Applications with Noise (DBSCAN) with ε and minPts tuned on a validation subset

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is DBSCAN and how does it work?

Why are k-d trees used in DBSCAN, and how do they improve efficiency?

How does DBSCAN handle noise and irregular cluster shapes better than k-means?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is DBSCAN and how does it work?

Why are k-d trees used in DBSCAN, and how do they improve efficiency?

How does DBSCAN handle noise and irregular cluster shapes better than k-means?

Report Issue