Your team is building a BigQuery ML regression model with CREATE MODEL and a TRANSFORM clause. Column promotion_code is a STRING with roughly 40 unique values. Column order_total is a NUMERIC field that contains many extreme outliers. You must 1) convert promotion_code into individual binary indicator features and 2) rescale order_total so that the model is less sensitive to outliers without clipping zeros. Which combination of manual preprocessing functions satisfies both requirements?
Use ML.ONE_HOT_ENCODER on promotion_code and ML.MAX_ABS_SCALER on order_total
Use ML.ONE_HOT_ENCODER on promotion_code and ML.ROBUST_SCALER on order_total
Use ML.FEATURE_CROSS on promotion_code and ML.NORMALIZER on order_total
Use ML.BUCKETIZE on promotion_code and ML.ROBUST_SCALER on order_total
ML.ONE_HOT_ENCODER transforms each distinct string value in promotion_code into a separate binary feature, which is the standard way to represent low-cardinality categorical variables for linear models. ML.ROBUST_SCALER scales a numeric column by subtracting its median and dividing by the inter-quartile range, making the transformation resistant to extreme outliers while retaining the original zero point if the median is near zero. The other options either apply a scaler that is not robust to outliers, use bucketization or feature crossing instead of one-hot encoding, or normalize the numeric column in a way that remains sensitive to outliers.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the purpose of ML.ONE_HOT_ENCODER in BigQuery ML?
Open an interactive chat with Bash
How does ML.ROBUST_SCALER handle numeric data and why is it resistant to outliers?
Open an interactive chat with Bash
Why would other preprocessing methods like ML.MAX_ABS_SCALER or ML.NORMALIZER be less effective for handling outliers in numeric columns?
Open an interactive chat with Bash
What does ML.ONE_HOT_ENCODER do, and why is it useful for regression models?
Open an interactive chat with Bash
How does ML.ROBUST_SCALER work, and why is it resistant to extreme outliers?
Open an interactive chat with Bash
Why is ML.ROBUST_SCALER preferred over other scaling methods like ML.NORMALIZER for handling outliers?
Open an interactive chat with Bash
GCP Professional Data Engineer
Preparing and using data for analysis
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .