AWS SAA-C03 - Practice Test #3
65 complete questions for AWS Solutions Architect Associate exam preparation.
An analytics company stores petabytes of data in Amazon S3. Data scientists access these files unpredictably; some datasets are queried heavily for weeks and then ignored for months, while others are read sporadically. The finance team demands a drastic reduction in the AWS bill. What is the MOST cost-effective solution for this scenario?
Category: Design Cost-Optimized Architectures
Explanation
Detailed breakdown of the correct answer
Amazon S3 Cost Optimization
When data access patterns are unknown or unpredictable, S3 Intelligent-Tiering is the ideal choice. It automatically moves objects between frequent, infrequent, and archive access tiers based on actual access patterns, without incurring retrieval charges.
Therefore, the correct answer is: Use the Amazon S3 Intelligent-Tiering storage class for all objects.
The option that says: Configure an S3 Lifecycle Policy... to S3 Standard-IA is incorrect because S3 Standard-IA charges a data retrieval fee. Since access is unpredictable, they could incur high retrieval costs if data is suddenly accessed.
The option that says: Move all data to Amazon S3 Glacier Flexible Retrieval... is incorrect because Glacier is designed for archiving and does not support the interactive, millisecond-latency access needed by data scientists.
The option that says: Store the data in Amazon S3 One Zone-IA... is incorrect because, in addition to charging for retrieval (like Standard-IA), it compromises durability by storing data in a single availability zone, which does not align with a Data Lake retention strategy unless the data is easily reproducible.