CompTIA DataX DY0-001 (V1) Practice Question

A fashion e-commerce company wants to roll out multimodal search so that shoppers can type a natural-language query such as "red leather ankle boots" and instantly retrieve the most relevant product images from a catalog of 50 million pictures. Design constraints include:

  • End-to-end latency must stay below 100 ms.
  • Queries are open-ended, not limited to a fixed set of classes.
  • The image catalog will be stored as dense vectors in an approximate nearest-neighbor (ANN) index. Which modeling strategy should the data-science team choose to satisfy all of these requirements while preserving strong semantic alignment between text and images?
  • Fine-tune a large language model on product captions only and use its [CLS] token embedding as the representation for both queries and images.

  • Train a contrastive dual-encoder (two-tower) model on paired caption-image data so that the text and image encoders produce vectors in the same embedding space, then pre-compute and ANN-index the image embeddings.

  • Deploy separate image and text classifiers and average their softmax probability outputs at query time (late fusion) to rank results.

  • Generate synthetic captions for every product image with an image-captioning model and index those captions with a TF-IDF bag-of-words search engine.

CompTIA DataX DY0-001 (V1)
Specialized Applications of Data Science
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot