CompTIA DataX DY0-001 (V1) Practice Question

You are integrating a k-nearest neighbors-based anomaly detector that flags points whose mean distance to their k nearest neighbors is unusually large. The raw data consist of 2 million rows with 200 numeric features. A prototype that uses brute-force neighbor search on the original features exceeds available memory and returns answers in minutes.

Which modification is most likely to reduce both memory usage and query latency without sacrificing the detector's ability to isolate outliers?

  • Build a KD-tree index on the original 200-dimensional features.

  • Apply PCA to reduce dimensionality, then build a ball-tree index on the reduced space.

  • Keep brute-force search but lower k from 20 to 5.

  • Switch the distance metric to cosine similarity and keep brute-force search.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot