CompTIA DataX DY0-001 (V1) Practice Question

A machine learning engineer is developing a predictive model for housing prices using a neural network. The dataset includes a zip_code feature with over 30,000 unique values. The engineer is concerned that using one-hot encoding for this high-cardinality feature will lead to extreme sparsity and the curse of dimensionality. To address this, the engineer implements an embedding layer for the zip_code feature. What is the primary advantage of using embeddings in this specific scenario?

  • It creates a dense, lower-dimensional vector representation that captures latent relationships between zip codes based on their association with the target variable (housing prices).

  • It performs principal component analysis (PCA) on the one-hot encoded vectors to reduce the feature space to a smaller set of linear components.

  • It replaces each zip_code with a single numerical value representing its frequency or its average target value, reducing dimensionality without a neural network.

  • It converts each unique zip_code into a unique integer, preserving all original information in a format that is directly usable by the model.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot