CompTIA DataX DY0-001 (V1) Practice Question

You are preparing the feature request_latency_ms for a regression model. Exploratory analysis shows that only the upper tail is problematic: about 2 % of records exceed 3 000 ms, while the lower tail appears clean. You must preserve all rows but limit the influence of those extreme high values using SciPy's winsorize so that only the top 2 % of observations are capped and the lower tail is left completely untouched.

import numpy as np
from scipy.stats.mstats import winsorize

latency = np.load('latency.npy')
latency_clean = winsorize(latency, limits=______)  # fill in

Which tuple correctly replaces ______ to meet the requirement?

(0.02, None)
(0.0, 0.02)
(None, 0.02)
(0.02, 0.02)

CompTIA DataX DY0-001 (V1)

Operations and Processes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is Winsorization, and why is it used?

How does the `limits` argument in `scipy.stats.mstats.winsorize` work?

Why is `(None, 0.02)` better than `(0.0, 0.02)` in this case?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is Winsorization, and why is it used?

How does the `limits` argument in `scipy.stats.mstats.winsorize` work?

Why is `(None, 0.02)` better than `(0.0, 0.02)` in this case?

Report Issue