CompTIA DataX DY0-001 (V1) Practice Question

You are validating a 10-million-row transaction table exported from a retail point-of-sale system. The quantity_sold column must hold non-negative integers, but a quick scan shows that about 0.08 % of the rows contain strings such as "ten", "five", or "three"-the result of occasional cashier keystrokes. No store, cashier, or date is consistently affected.

Which remediation best addresses this idiosyncratic data error while preserving the analytic usefulness of the column?

  • Map the spelled-out numerals to integers with a dictionary (e.g., Series.replace()) and then cast quantity_sold to an integer dtype.

  • Delete every record whose quantity_sold value is not already numeric to enforce column integrity.

  • Overwrite the entire quantity_sold column with its global median so every row shares a consistent numeric value.

  • Convert the whole quantity_sold column to string so it can store both numeric and text values unchanged.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot