AWS Certified Data Engineer Associate DEA-C01 Practice Question

An analytics team is setting up a new Amazon S3 data lake and needs a centralized catalog so analysts can query the data with Amazon Athena and Amazon Redshift Spectrum. The catalog should automatically discover schemas and partitions from S3 and require no infrastructure management. Which approach is the most cost-effective solution?

  • Use AWS Lake Formation to create resource links for the S3 location and configure an external Apache Ranger catalog for metadata storage.

  • Deploy an Apache Hive metastore on an Amazon EMR cluster and use HCatalog to register the S3 data.

  • Store schema information in an Amazon DynamoDB table and build AWS Lambda functions to update partition metadata.

  • Create an AWS Glue crawler for the S3 bucket and use the AWS Glue Data Catalog as the metastore for Athena and Redshift Spectrum.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot