AWS Certified Data Engineer Associate DEA-C01 Practice Question
A media company stores terabytes of Parquet files in an Amazon S3 data lake, which is queried daily by Amazon Athena and Amazon Redshift Spectrum. New regulations require that only approved columns and specific rows be visible to different analytics teams. Additionally, updates to the data must be transactionally consistent. The company wants a managed solution using open table formats that minimizes both ongoing maintenance of IAM policies and data duplication. Which solution best meets these requirements?
Convert the S3 location to an AWS Lake Formation governed table and grant the analytics teams access through Lake Formation permissions to enforce transactional consistency and access control.
Use an AWS Glue ETL job to convert the data to Apache Iceberg format in Amazon S3. Define row and column filters in AWS Lake Formation and grant the analytics teams access through Lake Formation permissions.
Load the data into Amazon Redshift tables. Create distinct schemas with column-level privileges for each team and use federated queries to access any remaining S3 data.
Use AWS Glue ETL jobs to create separate S3 copies of the data for each team, removing unauthorized columns, and restrict access to each copy using S3 bucket policies.
The best solution is to convert the data to the Apache Iceberg open table format and use AWS Lake Formation for governance. Apache Iceberg provides ACID transactional capabilities directly on Amazon S3, ensuring that concurrent readers always see a consistent view of the data without requiring data duplication. AWS Lake Formation integrates with Iceberg tables managed in the AWS Glue Data Catalog to enforce fine-grained, centralized column-level and row-level security permissions for services like Athena and Redshift Spectrum. This approach is managed, scalable, avoids custom IAM policy maintenance, and uses modern open standards. The other options are less suitable: creating data copies with AWS Glue increases storage costs and management overhead; loading data into Amazon Redshift moves it away from the cost-effective data lake storage; and using Lake Formation governed tables is not viable as the feature has been deprecated.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Apache Iceberg and how does it provide ACID transactional capabilities?
Open an interactive chat with Bash
How does AWS Lake Formation enforce fine-grained security permissions?
Open an interactive chat with Bash
Why is AWS Lake Formation governed tables deprecated and what alternative is recommended?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .