CompTIA DataX DY0-001 (V1) Practice Question

You are analyzing a behavioral-telemetry data set in which each user session is encoded as a 25,000-dimensional TF-IDF vector. After sampling 1,000 sessions, you compute the Euclidean distance from every vector to its nearest neighbor and to its farthest neighbor. The mean ratio of (distance to nearest neighbor) / (distance to farthest neighbor) is 0.98, indicating that the two distances are almost identical. Which phenomenon in high-dimensional geometry most directly explains why the nearest and farthest neighbors have nearly the same distance?

  • A heavy-tailed variance distribution created hub points that pulled average distances toward the mean.

  • All features were scaled improperly, adding the same constant to every distance calculation.

  • Only linear growth in sample size versus dimensionality obscured the neighborhood structure.

  • Distance concentration caused by the curse of dimensionality makes all pairs of points appear almost equidistant.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot