Microsoft Fabric Data Engineer Associate DP-700 Practice Question

You are using Apache Spark Structured Streaming in a Microsoft Fabric lakehouse to process click-stream data. Business analysts need rolling, device-level counts of clicks over the last 10 minutes, refreshed every 2 minutes. Late-arriving events (out-of-order by timestamp) must be accepted if they arrive within three minutes of their event time. Which aggregation statement meets all requirements?

  • .withWatermark("eventTime","3 minutes").groupBy(window(col("eventTime"), "10 minutes"), col("deviceId")).count()

  • .withWatermark("eventTime","3 minutes").groupBy(window(col("eventTime"), "10 minutes", "2 minutes"), col("deviceId")).count()

  • .groupBy(window(col("eventTime"), "10 minutes"), col("deviceId")).count()

  • .withWatermark("eventTime","10 minutes").groupBy(window(col("eventTime"), "2 minutes", "10 minutes"), col("deviceId")).count()

Microsoft Fabric Data Engineer Associate DP-700
Ingest and transform data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot