GCP Professional Data Engineer Practice Question

A retail company is implementing a streaming Dataflow pipeline that aggregates purchase events into fixed 60-minute windows keyed by store ID. Analysts need rolling insights: interim counts every 5 minutes while the window is still open, and one final, complete result as soon as the event-time watermark passes the window end. Events that arrive more than 10 minutes late must be silently dropped. Which Beam trigger configuration and pane accumulation setting best satisfy these requirements with minimal custom logic?

  • Use AfterWatermark past end-of-window with early firings every 5 minutes based on processing time, set allowed lateness to 10 minutes, and enable accumulating fired panes.

  • Use AfterWatermark past end-of-window without early firings, add late firings up to 10 minutes, and keep discarding fired panes.

  • Use Repeatedly(AfterCount(1000)) with accumulating fired panes and unlimited allowed lateness.

  • Use Repeatedly(AfterProcessingTime.pastFirstElement(5 minutes)) with allowed lateness 10 minutes and discarding fired panes.

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot