🔥 40% Off Crucial Exams Memberships — Deal ends today!

3 hours, 1 minute remaining!

GCP Professional Data Engineer Practice Question

You are building a BigQuery ML logistic-regression model on table prod.customers, which contains nullable numeric columns (usage_minutes, tenure_days) and a high-cardinality STRING column plan_type. Analysts will later call ML.PREDICT directly on the raw table from BI dashboards. You need to guarantee that missing numeric values are mean-imputed and that plan_type is one-hot encoded during both model training and every subsequent prediction, without requiring any additional preprocessing SQL in the dashboards. What should you do?

  • Apply only numeric normalization in the TRANSFORM clause and instruct dashboard developers to one-hot encode plan_type within their ML.PREDICT queries.

  • Create a materialized view that performs the imputing and one-hot encoding, train the model on that view, and require dashboards to invoke ML.PREDICT against the view instead of the raw table.

  • Run a scheduled Dataflow pipeline that writes a fully preprocessed feature table; instruct dashboards to join to this table before calling ML.PREDICT so that the model receives clean features.

  • Specify a TRANSFORM clause when you CREATE MODEL, using ML.IMPUTER for the numeric columns and ML.ONE_HOT_ENCODER for plan_type; BigQuery ML will reuse these transformations automatically during ML.PREDICT.

GCP Professional Data Engineer
Preparing and using data for analysis
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot