CompTIA DataX DY0-001 (V1) Practice Question

While preparing word embeddings from a 20-million-sentence corpus, a data scientist decides to use the GloVe algorithm rather than a predictive approach such as skip-gram with negative sampling. Which characteristic of GloVe's learning objective distinguishes it from those purely predictive models?

It relies on hierarchical softmax to approximate the full softmax over a large vocabulary during negative sampling.
It factorizes a TF-IDF term-document matrix with truncated singular value decomposition to obtain low-rank word vectors.
It maximizes the conditional probability of each context word given a target word using a full or sampled softmax output layer.
It minimizes a weighted least-squares loss so that the dot product of a word and a context vector equals the logarithm of their co-occurrence count, thereby preserving ratios of co-occurrence probabilities.

CompTIA DataX DY0-001 (V1)

Specialized Applications of Data Science

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is a co-occurrence matrix in natural language processing?

How does GloVe preserve ratios of co-occurrence probabilities?

What is the difference between GloVe and the skip-gram model with negative sampling?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is a co-occurrence matrix in natural language processing?

How does GloVe preserve ratios of co-occurrence probabilities?

What is the difference between GloVe and the skip-gram model with negative sampling?

Report Issue