AWS Certified Data Engineer Associate DEA-C01 Practice Question
An e-commerce company runs an Amazon Redshift RA3 cluster. Every day the data engineering team loads click-stream files from Amazon S3. Each file is a JSON array whose event attributes change frequently as the web application evolves. The team must ensure that newly added attributes are immediately queryable in Redshift with minimal ongoing maintenance while still delivering good analytic performance. Which solution meets these requirements?
Stream the files into an Amazon RDS PostgreSQL database with AWS DMS and query the data from Redshift using federated queries.
Create a Redshift table that contains a single SUPER column, run COPY with FORMAT JSON 'noshred', and use PartiQL to query new attributes.
Run an AWS Glue crawler before each load to update a Glue Data Catalog table on S3 and query it from Redshift Spectrum.
Maintain a JSONPaths file listing all attributes, use COPY with the JSONPaths option, and ALTER TABLE ADD COLUMN for every new attribute.
Storing the raw records in a column of type SUPER lets Amazon Redshift ingest the full JSON document without pre-defining every attribute. A COPY command with FORMAT JSON 'noshred' loads each record as a single SUPER object, so no table alteration or JSONPaths maintenance is needed when new fields appear. PartiQL and Redshift JSON functions can then query any attribute inside the SUPER object, while the data remains compressed and columnar for good performance.
Running a Glue crawler with Redshift Spectrum still requires executing the crawler before every load and can add query-time overhead. Maintaining a JSONPaths file and altering the table for every new attribute is labor-intensive. Moving the data to Amazon RDS and using federated queries introduces unnecessary complexity and doesn't solve the schema-evolution problem inside Redshift.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a SUPER column in Amazon Redshift?
Open an interactive chat with Bash
What does FORMAT JSON 'noshred' mean in the COPY command?
Open an interactive chat with Bash
What is PartiQL, and how does it work with Amazon Redshift?
Open an interactive chat with Bash
What are SUPER columns in Amazon Redshift?
Open an interactive chat with Bash
What is the benefit of using FORMAT JSON 'noshred' during a COPY operation?
Open an interactive chat with Bash
How does PartiQL work with Redshift SUPER columns?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .