CompTIA DataX DY0-001 (V1) Practice Question

In a neural text-classification pipeline, tokens are first converted to integer IDs with a string-indexing layer. The engineer reserves index 0 for a padding token, index 1 for an out-of-vocabulary (OOV) token, and assigns the remaining vocabulary sequentially from index 2 upward. What is the main advantage of reserving these low consecutive indices for the two special tokens when the data later flows into an Embedding layer?

  • It lets the text-vectorization step compute TF-IDF weights for the special tokens automatically, eliminating manual weighting.

  • It enables the framework to mask or zero-out the padding and OOV rows efficiently, preventing those tokens from affecting gradients during training or predictions.

  • It reduces the dimensionality of the embedding space by two, which significantly lowers memory consumption.

  • It forces the model to treat padding and OOV tokens as high-frequency words, accelerating convergence on small datasets.

CompTIA DataX DY0-001 (V1)
Specialized Applications of Data Science
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot