CompTIA DataX DY0-001 (V1) Practice Question

You are building a text-clustering workflow that starts with an extremely sparse 1 000 000 × 50 000 term-document matrix X. Because the matrix will not fit in memory when densified, constructing the covariance matrix XᵀX for a standard principal component analysis (PCA) is not an option. Instead, you choose to apply a truncated singular value decomposition (t-SVD) to reduce the dimensionality of X prior to clustering.

Which statement best explains why t-SVD is generally preferred over covariance-based PCA for this scenario?

t-SVD guarantees that the resulting singular vectors are both orthogonal and sparse, making clusters easier to interpret than those obtained from PCA.
t-SVD forces all components of the lower-dimensional representation to be non-negative, so the projected features can be read as probabilities without any post-processing.
t-SVD automatically scales every column of X to unit variance, eliminating the need for TF-IDF or other term-weighting schemes.
t-SVD can be computed with iterative methods (e.g., randomized SVD or Lanczos) that multiply X by vectors without ever materializing XᵀX, allowing the decomposition to run efficiently on the sparse matrix.

CompTIA DataX DY0-001 (V1)

Machine Learning

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

Why is t-SVD suitable for sparse matrices?

What are some iterative methods used in t-SVD computation?

How does covariance-based PCA differ in memory usage from t-SVD?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

Why is t-SVD suitable for sparse matrices?

What are some iterative methods used in t-SVD computation?

How does covariance-based PCA differ in memory usage from t-SVD?

Report Issue