GCP Professional Data Engineer Practice Question

An e-commerce retailer stores all order data in an on-premises Oracle 12c database. Analysts need this data available in BigQuery for near-real-time reporting, while the engineering team wants every change captured in Cloud Storage for replay if pipelines fail. The solution must

  • stream new and updated rows within seconds,
  • avoid installing agents or creating excessive read locks on the source database, and
  • automatically propagate future schema changes to the cloud targets. Which Google Cloud service should you choose to implement the data movement with minimal operational overhead?
  • Database Migration Service with continuous replication from Oracle to Cloud SQL followed by federated queries from BigQuery.

  • Datastream, to stream Oracle change data capture records into Cloud Storage and automatically ingest them into BigQuery.

  • BigQuery Data Transfer Service configured with an Oracle connector to load tables on a daily schedule.

  • A custom Apache Beam pipeline on Dataflow that polls the Oracle database and writes results to BigQuery and Cloud Storage.

GCP Professional Data Engineer
Designing data processing systems
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot