Microsoft Fabric Data Engineer Associate DP-700 Practice Question
You manage a Microsoft Fabric lakehouse that ingests micro-batches into a Delta table named Sales. After several months, the table contains thousands of very small data files, and analysts report that queries filtering on the OrderDate column now take much longer to finish. With minimal code changes, which action should you perform to most effectively improve scan and query performance on the Sales table?
Use COPY INTO to export the data to a single large Parquet file and replace the table.
Increase the notebook session's driver memory to provide more Spark cache capacity during queries.
Execute OPTIMIZE Sales ZORDER BY (OrderDate); to compact small files and cluster rows on the filter column.
Run VACUUM Sales RETAIN 0 HOURS; to delete obsolete data files from the table.
Running the Delta Lake OPTIMIZE command rewrites the table into a smaller number of large files, reducing metadata overhead and improving scan efficiency. Adding the ZORDER BY clause on a frequently filtered column such as OrderDate further clusters related rows in the same data pages, minimizing file-skipping and I/O during predicate pushdown. VACUUM only deletes obsolete files and does not reorganize current data. Converting to plain Parquet would remove Delta functionality and still leave many small files if singleFile cannot handle large data volumes. Increasing notebook driver memory may speed Spark jobs but does not address the root cause-file fragmentation in storage-so it yields little benefit for downstream queries executed by other engines.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Delta Lake's OPTIMIZE command?
Open an interactive chat with Bash
What is ZORDER BY in Delta Lake?
Open an interactive chat with Bash
Why doesn’t VACUUM improve query performance in this scenario?
Open an interactive chat with Bash
Microsoft Fabric Data Engineer Associate DP-700
Monitor and optimize an analytics solution
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .