We store files in S3 but we may need different types of storage based on our needs. Like some files that are infrequently accessed some that are needed once a year so paying the same charges as a normal storage does not make sense for these. So, based on our needs AWS has given us various types of storage based on needs.There are 6 types of storage types in AWS S3
- S3 Standard for general-purpose storage of frequently accessed data
- S3 Intelligent-Tiering for data with unknown or changing access patterns
- S3 Standard-Infrequent Access (S3 Standard-IA)for long-lived, but less frequently accessed data
- S3 One Zone-Infrequent Access (S3 One Zone-IA) for long-lived, but less frequently accessed data
- Amazon S3 Glacier (S3 Glacier) for archives
- Amazon S3 Glacier Deep Archive (S3 Glacier Deep Archive) for long-term archive and digital preservation.
All these storage classes varies on the availability of our objects in our S3 buckets for usage. Let us study each one of them in detail
S3 Standard
This is the storage type we have been using until now,it is the default for our S3 buckets. It has low latency and high availability. This is used when the data has a very frequent access. It has very high availability and AWS is bound by the SLA agreement on this which makes sure its always highly available. It is used for dynamic websites, content distribution, mobile and gaming applications, and big data analytics.
S3 Standard-Infrequent Access
This has 99.9% percent availability. Suitable for data when is accessed less but requires immediate access when needed. It requires less money for storage but whenever we retrieve the objects under this class we have to pay a fee so its not good for frequent access. It is mainly used for long-term storage, backups, and as a data store for disaster recovery files.
S3 Intelligent-Tiering
If you don't know where to put your data in standard access or Infrequent Access. We have this storage class in which S3 automatically stores data according to pour retrieval rates. If data is frequently accessed then it is put in standard storage other wise in Infrequent Access . This is the most cost optimized class if you don't want to think in which class to put your data. It is the ideal storage class for data sets with unknown storage access patterns, like new applications, or unpredictable access patterns, like data lakes.
S3 One Zone-Infrequent Access
One Zone-Infrequent Access it has all the same properties of Infrequent access except all the above storage are in 3 availability zones but this storage class is only in one AZ so is prone to disasters. It is ideal for customers who want a lower-cost option for infrequently accessed data but do not require the availability and resilience of S3 Standard or S3 Standard-IA. It’s a good choice for storing secondary backup copies of on-premises data or easily re-creatable data. You can also use it as cost-effective storage for data that is replicated from another AWS Region using S3 Cross-Region Replication. It is normally used for data that you want ti copy across regions as copying it across 3 AZ may not make sense.
Amazon S3 Glacier
S3 glacier is a very low cost storage available on S3. Normally data stored here is kept for years. you normally use it for data archiving. You can reliably store any amount of data at costs that are competitive with or cheaper than on-premises solutions. It has 3 sub classes of data retrievals available in S3 glacier
- Expedited - Data retrieval in 1 to 5 min
- Standard - Data retrieval in 3 to 5 hours
- Bulk - Data retrieval in 5 to 12 hours
Amazon S3 Deep Glacier
S3 Glacier Deep Archive is Amazon S3’s lowest-cost storage class and supports long-term retention and digital preservation for data that may be accessed once or twice in a year. It is designed for customers — particularly those in highly-regulated industries, such as the Financial Services, Healthcare, and Public Sectors — that retain data sets for 7-10 years or longer to meet regulatory compliance requirements. S3 Glacier Deep Archive can also be used for backup and disaster recovery use cases, and is a cost-effective and easy-to-manage alternative to magnetic tape systems, whether they are on-premises libraries or off-premises services. S3 Glacier Deep Archive complements Amazon S3 Glacier, which is ideal for archives where data is regularly retrieved and some of the data may be needed in minutes. It has 3 sub classes of data retrievals available in S3 deep glacier
- Standard - Data retrieval in 12 hours
- Bulk - Data retrieval in 48 hours
S3 Standard | S3 Intelligent-Tiering* | S3 Standard-IA | S3 One Zone-IA† | S3 Glacier | S3 Glacier Deep Archive | |
---|---|---|---|---|---|---|
Designed for durability | 99.999999999% (11 9’s) | 99.999999999% (11 9’s) | 99.999999999% (11 9’s) | 99.999999999% (11 9’s) | 99.999999999% (11 9’s) | 99.999999999% (11 9’s) |
Designed for availability | 99.99% | 99.9% | 99.9% | 99.5% | 99.99% | 99.99% |
Availability SLA | 99.9% | 99% | 99% | 99% | 99.9% | 99.9% |
Availability Zones | ≥3 | ≥3 | ≥3 | 1 | ≥3 | ≥3 |
Minimum capacity charge per object | N/A | N/A | 128KB | 128KB | 40KB | 40KB |
Minimum storage duration charge | N/A | 30 days | 30 days | 30 days | 90 days | 180 days |
Retrieval fee | N/A | N/A | per GB retrieved | per GB retrieved | per GB retrieved | per GB retrieved |
First byte latency | milliseconds | milliseconds | milliseconds | milliseconds | select minutes or hours | select hours |
Storage type | Object | Object | Object | Object | Object | Object |
Lifecycle transitions | Yes | Yes | Yes | Yes | Yes | Yes |
In this article we studies about the various S3 classes in the next article we will study how to use them and move various things into different storage classes.
Top comments (8)
Thanks,
Could you explain this:
"One Zone-Infrequent Access it has all the same properties of Infrequent access except all the above storage are in 3 availability zones but this storage class is only in one AZ so is prone to disasters." ?
What do you mean by all others types are in >=3 availability zones?
Does it mean that there are always synced s3 buckets over >=3 zones by default?
So we get 3 synced replicas by default? Each one stands as backup with primary one?
Yes as you can see in the chart by default other classes are stored in 3 or more Availability zones by default in AWS. One Zone-Infrequent Access but in one zone IF its only stored in a single availability zone which makes it a cheaper option
So all 3 others buckets operate as replicas or standalone backups which do not accept connection?
Could not understand you
All 3 replicas operate and accept connection, or they act as backup?
AWS is bound by the sla if the fail you to provide high availability they may have to return billions of dollars to customers. So, they automatically copy it over 3 AZ
Ok so these 3 do not operate as read replicas for example. By default they stand for backup?
I don't know this is a managed service by AWS and this is the info they give so basically they manage everything without you knowing and I don't know how they implemented it