🔥 40% Off Crucial Exams Memberships — Deal ends today!

54 minutes, 4 seconds remaining!

GCP Professional Data Engineer Practice Question

You are designing the vector store layer of a retrieval-augmented generation (RAG) pipeline for 10 million technical-manual paragraphs. Each paragraph will be embedded with ML.GENERATE_EMBEDDING using the Vertex AI model textembedding-gecko. At query time, a user question must be embedded once and matched against the stored vectors with millisecond-level latency directly inside BigQuery. Future product manuals will arrive daily, so the solution must keep the index current without manual rebuilds and avoid unnecessary I/O when the RAG query runs. Which approach best meets these requirements?

  • Store each paragraph and its ARRAY embedding in a partitioned, clustered BigQuery table, then create a VECTOR index on the embedding column for approximate nearest-neighbor search.

  • Build a materialized view that calls ML.GENERATE_EMBEDDING at query time so vectors are not stored; scan the view with a WHERE clause that compares embeddings using the COSINE_DISTANCE function.

  • Export embeddings as JSON to Cloud Storage, ingest them into Vertex AI Matching Engine, and perform the similarity search there, returning IDs to BigQuery for a join.

  • Write embeddings to an external BigLake table over Cloud Storage files and invoke ML.ANN_SEARCH with a full brute-force scan on every query.

GCP Professional Data Engineer
Preparing and using data for analysis
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot