CompTIA DataX DY0-001 (V1) Practice Question

A payments-security team is clustering 100 000 transaction embeddings, each represented by 128 continuous features. They believe fraudulent user rings form clusters that are highly irregular in shape, vary greatly in size, and are surrounded by many benign transactions that should be labeled as noise. Because the true number of fraud rings is unknown, the team needs an algorithm that can discover an appropriate number of clusters on its own. For scalability, they will accelerate neighborhood queries with a k-d tree and aim for an overall runtime close to O(n log n). Which unsupervised technique best satisfies these requirements?

  • Agglomerative hierarchical clustering using Ward linkage and a dendrogram cutoff

  • Density-Based Spatial Clustering of Applications with Noise (DBSCAN) with ε and minPts tuned on a validation subset

  • k-means clustering with the elbow method to determine the value of k

  • Expectation-Maximization Gaussian mixture modeling with Bayesian information criterion (BIC) to select the number of components

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot