CompTIA DataX DY0-001 (V1) Practice Question

You are preparing a 250,000 × 150,000 TF-IDF document-term matrix stored in CSR sparse format. The downstream model is an SGD-optimized linear classifier that applies an L2 penalty and assumes all numeric features are on comparable scales. Because of memory limits, centering the data is not an option-any operation that alters zero entries would densify the matrix and make the process infeasible. Which scaling technique is the most appropriate to meet the model's requirements while preserving sparsity?

Apply standard z-score scaling that subtracts the mean and divides by the standard deviation for every feature.
Use a Robust scaler that subtracts the median and divides by the interquartile range of each feature.
Transform the data with a MinMax scaler to map every feature into the interval .
Scale each column with a MaxAbs scaler so its maximum absolute value becomes 1 while zeros remain unchanged.

CompTIA DataX DY0-001 (V1)

Modeling, Analysis, and Outcomes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

Why is MaxAbs scaling preferred for sparse matrices?

How does MaxAbs scaling compare to z-score scaling for sparse data?

What would happen if a MinMax scaler were applied to TF-IDF data?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

Why is MaxAbs scaling preferred for sparse matrices?

How does MaxAbs scaling compare to z-score scaling for sparse data?

What would happen if a MinMax scaler were applied to TF-IDF data?

Report Issue