GCP Professional Data Engineer Practice Question

You are building a churn-propensity model in BigQuery ML. The training table contains a numeric column named total_spend that ranges from a few cents to several thousand US dollars, and its distribution is extremely skewed. Business analysts want the model to treat spend as four ordered categories-"< 25", "25-100", "100-500", and ">= 500"-so that coefficients are learned per range and the same transformation is applied when the model is used for prediction. Inside the CREATE MODEL statement you plan to express this logic in a TRANSFORM clause. Which BigQuery ML manual preprocessing function should you use to implement the required transformation?

Apply ML.ROBUST_SCALER() to normalize total_spend using its interquartile range.
Apply ML.BUCKETIZE() with the split points in the TRANSFORM clause.
Apply ML.MAX_ABS_SCALER() to rescale total_spend between -1 and 1 before training.
Apply ML.FEATURE_CROSS() to create four spend category indicators from total_spend.

GCP Professional Data Engineer

Preparing and using data for analysis

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

GCP Professional Data Engineer Practice Question

Answer Description

Ask Bash

How does ML.BUCKETIZE work in BigQuery ML?

What is the difference between ML.BUCKETIZE and ML.FEATURE_CROSS?

When should I use ML.MAX_ABS_SCALER or ML.ROBUST_SCALER instead of ML.BUCKETIZE?

What does ML.BUCKETIZE() do in BigQuery ML?

How is ML.BUCKETIZE() different from scaling techniques like ML.MAX_ABS_SCALER?

When should ML.ROBUST_SCALER and ML.FEATURE_CROSS be used instead of ML.BUCKETIZE?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

GCP Professional Data Engineer Practice Question

Report Issue

Answer Description

Ask Bash

How does ML.BUCKETIZE work in BigQuery ML?

What is the difference between ML.BUCKETIZE and ML.FEATURE_CROSS?

When should I use ML.MAX_ABS_SCALER or ML.ROBUST_SCALER instead of ML.BUCKETIZE?

What does ML.BUCKETIZE() do in BigQuery ML?

How is ML.BUCKETIZE() different from scaling techniques like ML.MAX_ABS_SCALER?

When should ML.ROBUST_SCALER and ML.FEATURE_CROSS be used instead of ML.BUCKETIZE?

Report Issue