CompTIA DataX DY0-001 (V1) Practice Question

Your team is building a real-time recommendation service that must match a shopper's free-text query against more than 10 million product descriptions in under 50 ms. The catalog can be processed offline, and several gigabytes of pre-computed representations may be kept in memory, but the online request path should perform at most one neural-network forward pass per query. Relevance should be judged by semantic rather than purely lexical similarity. Which modelling strategy best satisfies the latency, scale, and semantic-matching requirements?

Pre-encode all item descriptions with a Siamese/bi-encoder transformer, store the vectors in an ANN index, and encode the query once at inference to retrieve nearest neighbours.
Compute the Levenshtein edit distance between the query string and every item title, selecting the smallest distances.
Represent each text as a sparse TF-IDF vector and rank candidates with BM25 scoring over an inverted index.
Run a transformer cross-encoder that concatenates the query with every candidate description and scores each pair on-the-fly.

CompTIA DataX DY0-001 (V1)

Specialized Applications of Data Science

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is an ANN index, and why is it useful in this modeling strategy?

What is the difference between a dual-encoder and a cross-encoder?

Why don’t traditional methods like BM25 or Levenshtein satisfy the semantic matching requirement?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is an ANN index, and why is it useful in this modeling strategy?

What is the difference between a dual-encoder and a cross-encoder?

Why don’t traditional methods like BM25 or Levenshtein satisfy the semantic matching requirement?

Report Issue