CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is evaluating several classifiers for a large-scale e-mail filtering project. The feature set is a 750 000 × 120 000 bag-of-words matrix stored in compressed-sparse-row (CSR) format with fewer than 1 % non-zero values. Training MultinomialNB and LinearSVC completes quickly and stays below 4 GB of RAM, but running GaussianNB on the same matrix causes the Python process to allocate more than 60 GB before the job is killed.

Which property of sparse-matrix handling in this scenario best explains why the GaussianNB run exhausts memory while the other two models do not?

GaussianNB requires integer word-count features, so it duplicates the sparse matrix as a separate float array before fitting.
GaussianNB computes an all-pairs Euclidean distance matrix and therefore materializes a full n × n distance table in memory.
GaussianNB implicitly converts the CSR matrix to a dense array in order to calculate feature means and variances, causing all zero entries to be stored explicitly.
GaussianNB applies kernel density estimation that adds synthetic features, dramatically increasing dimensionality when the input is sparse.

CompTIA DataX DY0-001 (V1)

Modeling, Analysis, and Outcomes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

Why does GaussianNB convert a sparse matrix to a dense array?

What are CSR matrices and why are they efficient for sparse data?

How do MultinomialNB and LinearSVC handle sparse matrices efficiently?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

Why does GaussianNB convert a sparse matrix to a dense array?

What are CSR matrices and why are they efficient for sparse data?

How do MultinomialNB and LinearSVC handle sparse matrices efficiently?

Report Issue