CompTIA DataX DY0-001 (V1) Practice Question

A streaming service data scientist is building a logistic regression model to predict whether a subscriber will churn in the next quarter. The raw dataset contains two continuous variables:

  • TotalSpent - total amount (USD) the customer paid in the last 12 months
  • MonthlyBudget - the customer's self-reported discretionary budget (USD)

MonthlyBudget ranges from about 50 USD to 10 000 USD, so absolute spending values vary by several orders of magnitude. Exploratory analysis shows that customers who spend at least 60 % of their budget rarely churn, regardless of how many dollars they actually spend.

Which transformation best encodes this insight and minimizes the influence of raw scale differences when training the model?

  • Apply Z-score standardization separately to TotalSpent and MonthlyBudget so that each has mean 0 and variance 1.

  • Create a new feature spend_to_budget_ratio = TotalSpent / (12 * MonthlyBudget) and use this ratio in place of the two original variables.

  • Introduce an interaction term by multiplying the two variables (TotalSpent * MonthlyBudget) and keep both originals.

  • Replace TotalSpent with log10(TotalSpent) to reduce skewness while leaving MonthlyBudget unchanged.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot