CompTIA DataX DY0-001 (V1) Practice Question

You are preparing a large e-commerce transactions table for a sales-forecasting model. A validation query reveals that every record whose client_app_version equals "3.2.1-legacy" shows order_amount values about 100 × larger than comparable orders (for example, a typical $75 purchase is stored as 7 500).

Mobile-engineering confirms this specific app version sent monetary values in cents instead of dollars; no other rows are affected.

To correct the data while preserving information and maintaining data-lineage metadata, which data-wrangling action should you take?

Replace the affected order_amount values with NULL and later impute them with the overall median order value.
Drop all rows generated by client_app_version = '3.2.1-legacy' to remove the corrupted records completely.
Treat the inflated values as idiosyncratic errors and winsorize order_amount at the 99th percentile across the entire dataset to cap extreme values.
Identify the issue as a scale-factor systematic error and divide order_amount by 100 only for rows where client_app_version = '3.2.1-legacy', recording the transformation in the pipeline metadata.

CompTIA DataX DY0-001 (V1)

Operations and Processes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is a scale-factor systematic error?

What is data-lineage metadata, and why is it important?

Why is winsorizing or removing affected rows not the best solution?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is a scale-factor systematic error?

What is data-lineage metadata, and why is it important?

Why is winsorizing or removing affected rows not the best solution?

Report Issue