CompTIA DataX DY0-001 (V1) Practice Question

As part of implementing a single-output neural network layer, you define the pre-activation value as z = Wᵗx + b with W, x ∈ ℝⁿ and compute the prediction y_hat = σ(z), where σ is the logistic sigmoid. For one training sample you use the squared-error loss function

L = ½ (y_hat − y)²

Using the multivariate chain rule, which expression gives the gradient ∂L/∂W?

(y_hat − y) y_hat (1 − y_hat) x
(y_hat − y) (1 − y_hat) x
y_hat (1 − y_hat) x
(y_hat − y) x

CompTIA DataX DY0-001 (V1)

Mathematics and Statistics

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is the multivariate chain rule?

What is the logistic sigmoid function and why is it used?

Why is the squared-error loss function appropriate here?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is the multivariate chain rule?

What is the logistic sigmoid function and why is it used?

Why is the squared-error loss function appropriate here?

Report Issue