AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company ingests daily sales data into Amazon S3 as CSV files. In Amazon SageMaker Data Wrangler, a data engineer must add a repeatable step that (1) flags-but does not remove-any rows whose total_price value is more than three standard deviations above or below the column mean and (2) lets analysts later calculate the percentage of rows that contain such values. Which built-in Data Wrangler transform meets these requirements with the least custom code?
Standard Deviation Numeric Outliers (Fix method: Invalidate)
The Handle outliers group contains the transform Standard Deviation Numeric Outliers. When the engineer sets the threshold to three standard deviations and chooses the Fix method Invalidate, Data Wrangler creates a new output column in which only the outlier values are converted to invalid (NaN) entries. The rows remain in the dataset, so analysts can count invalid values to determine the percentage of outliers. Other options either delete the rows, handle missing data, or merely summarize statistics, so they do not satisfy both requirements.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What does the 'Standard Deviation Numeric Outliers' transform do?
Open an interactive chat with Bash
What is the purpose of the 'Fix method: Invalidate' setting in Data Wrangler?
Open an interactive chat with Bash
How do analysts calculate the percentage of outliers flagged by Data Wrangler?
Open an interactive chat with Bash
What is the Fix method 'Invalidate' in Data Wrangler?
Open an interactive chat with Bash
How does the Standard Deviation Numeric Outliers transform work?
Open an interactive chat with Bash
Why would removing outliers using the IQR method be unsuitable for this scenario?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .