CompTIA DataX DY0-001 (V1) Practice Question

You are preparing the feature request_latency_ms for a regression model. Exploratory analysis shows that only the upper tail is problematic: about 2 % of records exceed 3 000 ms, while the lower tail appears clean. You must preserve all rows but limit the influence of those extreme high values using SciPy's winsorize so that only the top 2 % of observations are capped and the lower tail is left completely untouched.

import numpy as np
from scipy.stats.mstats import winsorize

latency = np.load('latency.npy')
latency_clean = winsorize(latency, limits=______)  # fill in

Which tuple correctly replaces ______ to meet the requirement?

  • (0.02, None)

  • (0.02, 0.02)

  • (0.0, 0.02)

  • (None, 0.02)

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot