🔥 40% Off Crucial Exams Memberships — This Week Only

3 days, 1 hour remaining!

AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company is building a data lake on Amazon S3 and wants analysts in multiple AWS accounts to easily discover and query new CSV and Parquet files as soon as they arrive. The solution must automatically capture schema and partition information and store it in a centralized catalog that Lake Formation permissions can secure. Which action should the data engineer take first to create the required data catalog?

  • Use AWS DataSync to copy the files to an Amazon EMR HDFS cluster and rely on Hive metastore auto-creation during Spark jobs.

  • Export an Amazon S3 Inventory report to CSV and load the object list into external tables using Amazon Redshift Spectrum.

  • Run daily MSCK REPAIR TABLE statements in Amazon Athena to add partitions without using a catalog.

  • Create an AWS Glue database, then configure a Glue crawler with an IAM role permitted by Lake Formation to crawl the S3 prefixes on a schedule.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot