CompTIA DataX DY0-001 (V1) Practice Question

Your team is building a real-time recommendation service that must match a shopper's free-text query against more than 10 million product descriptions in under 50 ms. The catalog can be processed offline, and several gigabytes of pre-computed representations may be kept in memory, but the online request path should perform at most one neural-network forward pass per query. Relevance should be judged by semantic rather than purely lexical similarity. Which modelling strategy best satisfies the latency, scale, and semantic-matching requirements?

Compute the Levenshtein edit distance between the query string and every item title, selecting the smallest distances.
Run a transformer cross-encoder that concatenates the query with every candidate description and scores each pair on-the-fly.
Represent each text as a sparse TF-IDF vector and rank candidates with BM25 scoring over an inverted index.
Pre-encode all item descriptions with a Siamese/bi-encoder transformer, store the vectors in an ANN index, and encode the query once at inference to retrieve nearest neighbours.

CompTIA DataX DY0-001 (V1)

Specialized Applications of Data Science

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is an ANN index, and why is it useful in this modeling strategy?

What is the difference between a dual-encoder and a cross-encoder?

Why don’t traditional methods like BM25 or Levenshtein satisfy the semantic matching requirement?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is an ANN index, and why is it useful in this modeling strategy?

What is the difference between a dual-encoder and a cross-encoder?

Why don’t traditional methods like BM25 or Levenshtein satisfy the semantic matching requirement?

Report Issue