CompTIA DataX DY0-001 (V1) Practice Question

You are preparing a 250,000 × 150,000 TF-IDF document-term matrix stored in CSR sparse format. The downstream model is an SGD-optimized linear classifier that applies an L2 penalty and assumes all numeric features are on comparable scales. Because of memory limits, centering the data is not an option-any operation that alters zero entries would densify the matrix and make the process infeasible. Which scaling technique is the most appropriate to meet the model's requirements while preserving sparsity?

  • Apply standard z-score scaling that subtracts the mean and divides by the standard deviation for every feature.

  • Scale each column with a MaxAbs scaler so its maximum absolute value becomes 1 while zeros remain unchanged.

  • Use a Robust scaler that subtracts the median and divides by the interquartile range of each feature.

  • Transform the data with a MinMax scaler to map every feature into the interval .

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot