AWS Certified Solutions Architect Associate SAA-C03 Practice Question
A scientific research institute needs to offload a large collection of genomic data sets from its on-premises servers to AWS. The data sets are seldom accessed, but when they are, a delay of several hours is acceptable. The institute requires a highly cost-effective solution for storing and retrieving these data sets, with a strong focus on minimizing storage costs. What method represents the MOST cost-optimized approach to store this data?
Leverage S3 Intelligent-Tiering to automatically optimize costs between frequent and infrequent access tiers for the data sets.
Implement a Storage Gateway with stored volumes to gradually move the data sets into Amazon S3 over a direct connection.
Store the genomic data sets using the S3 Glacier Deep Archive storage class after initial upload completion.
Utilize the Hadoop Distributed File System (HDFS) on Amazon EMR with occasional syncing to S3 for the data sets not actively in use.