CompTIA DataX DY0-001 (V1) Practice Question

You are integrating a k-nearest neighbors-based anomaly detector that flags points whose mean distance to their k nearest neighbors is unusually large. The raw data consist of 2 million rows with 200 numeric features. A prototype that uses brute-force neighbor search on the original features exceeds available memory and returns answers in minutes.

Which modification is most likely to reduce both memory usage and query latency without sacrificing the detector's ability to isolate outliers?

Apply PCA to reduce dimensionality, then build a ball-tree index on the reduced space.
Build a KD-tree index on the original 200-dimensional features.
Keep brute-force search but lower k from 20 to 5.
Switch the distance metric to cosine similarity and keep brute-force search.

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is PCA and how does it reduce dimensionality?

Why is a ball-tree more effective than a KD-tree in high dimensions?

Why does brute-force search perform poorly on large datasets with many features?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is PCA and how does it reduce dimensionality?

Why is a ball-tree more effective than a KD-tree in high dimensions?

Why does brute-force search perform poorly on large datasets with many features?

Report Issue