Microsoft Fabric Data Engineer Associate DP-700 Practice Question

You work in a Microsoft Fabric lakehouse. The Sales table has about 500 million rows, and the ProductSubcategory and ProductCategory tables each have fewer than 1 000 rows. You must build a daily Gold-layer table that denormalizes Sales with subcategory and category attributes while minimizing network shuffle and keeping the join in memory. Which Spark technique should you apply before running the joins?

Combine the three DataFrames with unionByName() and apply filters afterward.
Disable Adaptive Query Execution so that Spark resorts to default shuffle hash joins.
Repartition the Sales DataFrame to a single partition, then perform the joins sequentially.
Use the Spark broadcast() function (or BROADCAST join hint) on the two small lookup DataFrames before joining them to Sales.

Microsoft Fabric Data Engineer Associate DP-700

Ingest and transform data

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

Microsoft Fabric Data Engineer Associate DP-700 Practice Question

Answer Description

Ask Bash

What is a Spark broadcast join?

What is Adaptive Query Execution (AQE) in Spark?

How does network shuffle affect performance in Spark?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

Microsoft Fabric Data Engineer Associate DP-700 Practice Question

Report Issue

Answer Description

Ask Bash

What is a Spark broadcast join?

What is Adaptive Query Execution (AQE) in Spark?

How does network shuffle affect performance in Spark?

Report Issue