AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company stores compressed CSV files at s3://datalake/transactions/year=YYYY/month=MM/day=DD/. Analysts run ad-hoc SQL with Amazon Athena. The engineering team must (1) keep partitions current as new daily folders land, (2) attach business metadata such as data owner and sensitivity, and (3) let governance users search by that metadata. Which approach delivers these capabilities with the least operational effort?
Schedule an AWS Glue crawler for the S3 prefix and store business metadata as Data Catalog tags or table properties in the AWS Glue Data Catalog.
Create a Lake Formation Blueprint to catalog the data and save business metadata as user-defined object metadata on the S3 files.
Use a daily Lambda function to run MSCK REPAIR TABLE on an Athena external table and keep business metadata in AWS Secrets Manager.
Ingest the data into Amazon Redshift, create an external schema with Redshift Spectrum, and place business metadata in Redshift COMMENT statements.
An AWS Glue crawler can be scheduled to crawl the S3 prefix. The crawler automatically infers the schema of the CSV files and adds new partitions to the same table whenever new year/month/day folders appear, eliminating manual repair procedures. Glue Data Catalog resources (databases, tables, and columns) support key-value Data Catalog tags as well as custom table properties. Tags can be assigned programmatically or in the console to hold business information such as data owner or sensitivity. The governance team can then search the catalog or build Lake Formation tag-based policies by using those tags. The other options either require manual partition maintenance (Lambda that runs MSCK REPAIR), move the data into Redshift (additional cost and ETL) or store business metadata in places that are not searchable inside the catalog (S3 object metadata). Therefore, using an AWS Glue crawler and Data Catalog tags is the simplest and most operationally efficient approach.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What does an AWS Glue crawler do?
Open an interactive chat with Bash
What are AWS Glue Data Catalog tags used for?
Open an interactive chat with Bash
How can Lake Formation tag-based policies enhance governance?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .